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PREFACE 



Classical vibration theory is concerned with the determination of 
the response of a given dynamical system to a prescribed input. These 
are called direct problems in vibration and powerful analytical and nu- 
merical methods are available nowadays for their solution. However, 
when one studies a phenomenon which is governed by the equations 
of classical dynamics, the application of the model to real life situa- 
tions often requires the knowledge of constitutive and/or geometrical 
parameters which in the direct formulation are considered as part of 
the data, whereas, in practice, they are not completely known or are 
inaccessible to direct measurements. Therefore, in several areas in 
applied science and technology, one has to deal with inverse problems 
in vibration, that is problems in which the roles of the unknowns and 
the data is reversed, at least in part. For example, one of the basic 
problems in the direct vibration theory - for infinitesimal undamped 
free vibrations - is the determination of the natural frequencies and 
normal modes of the vibrating body, assuming that the stiffness and 
mass coefficients are known. In the context of inverse theory, on the 
contrary, one is dealing with the construction of a model of a given 
type (i.e., a mass-spring system, a string, a beam) that has given 
eigenproperties. 

In addition to its applications, the study of inverse problems in 
vibration has also inherent mathematical interest, since the issues en- 
countered have remarkable features in terms of originality and tech- 
nical difficulty, when compared with the classical problems of direct 
vibration theory. In fact, inverse problems do not usually satisfy the 
Hadamard postulates of well-posedeness, also, in many cases, they 
are extremely non-linear, even if the direct problem is linear. In most 
cases, in order to overcome such obstacles, it is impossible to invoke 
all-purpose, ready made, theoretical procedures. Instead, it is neces- 
sary to single out a suitable approach and trade-off with the intrinsic 
ill-posedeness by using original ideas and a deep use of mathematical 
methods from various areas. Another specific and fundamental aspect 
of the study of inverse problems in vibration concerns the numerical 
treatment and the development of ad-hoc strategies for the treatment 
of ill-conditioned, linear and non-linear problems. Finally, when in- 
verse techniques are applied to the study of real problems, additional 



obstructions arise because of the complexity of mechanical modelling, 
the inadequacy of the analytical models used for the interpretation of 
the experiments, measurement errors and incompleteness of the field 
data. Therefore, of particular relevance for practical applications is 
to assess the robustness of the algorithms to measurement errors and 
to the accuracy of the analytical models used to describe the physical 
phenomenon. 

The purpose of the CISM course entitled "Dynamical Inverse Prob- 
lems: Theory and Application" , held in Udine on May 25-29 2009, 
was to present a state-of-the-art overview of the general aspects and 
practical applications of dynamic inverse methods, through the in- 
teraction of several topics, ranging from classical and advanced in- 
verse problems in vibration, isospectral systems, dynamic methods for 
structural identification, active vibration control and damage detec- 
tion, imaging shear stiffness in biological tissues, wave propagation, 
computational and experimental aspects relevant for engineering prob- 
lems. 

The course was addressed to PhD students and researchers in civil 
and mechanic engineering, applied mathematics, academic and indus- 
trial researchers. 

In the first chapter Gladwell discusses matrix inverse eigenvalue 
problems. He describes the classical inverse problems for in-line sys- 
tems, such as discrete models of rods in longitudinal vibration and 
beams in flexural vibration. He illustrates the theory governing the 
formation of isospectral systems, and describe how it may be used to 
construct isospectral finite- element models of membranes. Through- 
out the chapter, emphasis is placed on ways of choosing data that 
lead to a realistic system. Morassi in the second chapter describes 
some classical approaches to the inversion of continuous second-order 
systems. Attention is focused on uniqueness, stability and existence 
results for Sturm- Liouville differential operators given in canonical 
form on a finite interval. A uniqueness result for a fourth order Euler- 
Bernoulli operator of the bending vibration of a beam is also discussed. 
The next chapter by Rohrl presents a method to numerically solve 
the Sturm- Liouville inverse problem using a least squares approach 
based on eigenvalue data. The potential and the boundary conditions 
are estimated from two sequences of spectral data in several exam- 
ples. Theorems show why this approach works particularly well. An 



introduction to the Boundary Control method (BC-method) for solv- 
ing inverse problems is presented by Belishev in the fourth chapter. 
In particular, the one- dimensional version of the BC-method is used 
for two dynamical inverse problems. The first problem is to recover 
the potential in a Sturm- Liouville operator describing the transverse 
vibration of a semi-infinite taut string with constant linear mass den- 
sity by time-history measurements at the endpoint of the string. The 
second problem deals with a second-order vectorial dynamical system 
governing, for example, the longitudinal vibration of two semi- infinite 
connected beams having constant linear mass densities. An inverse 
problem is to recover the matrix coefficients of the lower order terms 
via time- domain measurements at the endpoint of the beam. Con- 
nections between the BC-method and asymptotic methods in PDEs, 
functional analysis, control and system theory, are especially investi- 
gated in this chapter. In the fifth chapter Vestroni and Pau introduce 
dynamic methods for dynamic characterization and damage identifi- 
cation of civil engineering structures. Indeterminacy and difficulties 
in modal identification and model updating are discussed with refer- 
ence to several experimental cases of masonry structures. A dam- 
age identification procedure in a steel arch with concentrate damage 
is also presented. Eigenvalue assignment problems in vibration are 
presented by Mottershead, Tehrani and Ram in the sixth chapter. 
Procedures are described for pole placement by passive modification 
and active control using measured receptances. The theoretical basis 
of the method is described and experimental implementation is ex- 
plained. The book ends with the lectures by Oberai and Barbone on 
inverse problems in biomechanical imaging. The authors briefly de- 
scribe the clinical relevance of these problems and how the measured 
data is acquired. Attention is focused on two distinct strategies for 
solving these problems. These include a direct approach that is fast 
but relies on the availability of complete interior displacement mea- 
surements. The other is an iterative approach that is computationally 
intensive but is able to handle incomplete interior data and is robust 
to noise. 

Graham M.L. Gladwell and Antonino Morassi 
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Matrix Inverse Eigenvalue Problems 



Graham M.L. Gladwell 

Department of Civil Engineering, 
University of Waterloo, Waterloo, Ontario, Canada 

Abstract The term matrix eigenvalue problems refers to the com- 
putation of the eigenvalues of a symmetric matrix. By contrast, the 
term inverse matrix eigenvalue problem refers to the construction 
of a symmetric matrix from its eigenvalues. While matrix eigen- 
value problems are well posed, inverse matrix eigenvalue problems 
are ill posed: there is an infinite family of symmetric matrices with 
given eigenvalues. This means that either some extra constraints 
must be imposed on the matrix, or some extra information must be 
supplied. These lectures cover four main areas: i) Classical inverse 
problems relating to the construction of a tridiagonal matrix from 
its eigenvalues and the first (or last) components of its eigenvectors, 
ii) Application of these results to the construction of simple in-line 
mass-spring systems, and a discussion of extensions of these results 
to systems with tree structure, iii) Isospectral systems (systems 
that all have the same eigenvalues) studied in the context of the 
QR algorithm, with special attention paid to the important con- 
cept of total positivity. iv) Introduction to the concept of Toda 
fiow, a particular isospectral fiow. 

1 Lecture 1. Classical Inverse Problems 

1.1 Introduction 

The word inverse, the related terms reverse, and mirror image, implicitly 
state that there is some 'object' that is related to another 'object' called its 
inverse and just as a physical object and its image in a mirror are related. 
Then talking about mirrors, we tend to label one object 'real' and the other 
'the reflection' suggesting tliat the 'reflection' is not 'real'. In our situation, 
dealing with mathematical objects that are 'problems', again we often label 
one problem as the 'real' one and the other, the inverse, as somehow not so 
real. Strictly speaking, we will just have two problems and can label one 
a 'direct' problem and the other the 'inverse' problem. Which problem is 
called direct and which inverse is often an accident of history: the direct 
one is the familiar one. 
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The inverse problems we will analyze arc all variants of so-called matrix 
inverse eigenvalue problems. For an exhaustive catalogue of such problems, 
see Chu and Golub (2005). See also Xu (1998) and Gladwell (2004). Because 
we are interested primarily in problems related to the undamped vibrations 
of elastic structures, we limit our attention to eigenvalue problems related to 
a real, square, symmetric matrix. We denote the set of real square matrices 
of order n by M„, the subset of symmetric matrices by Sn, and the set of 
real n x 1 column vectors by i?„. 

If A £ S'n, then a value of A for which there is a non-zero x e i?„ such 
that 

(A - AI)x = (1) 

is called an eigenvalue of A; x is called an eigenvector of A corresponding 
to A. It is known that A g Sn has exactly n real eigenvalues (A^)", not 
necessarily distinct, and n corresponding eigenvectors (x^)" that span the 
space Rn- If A; ^ Aj, then x^ and Xj are orthogonal: xfx^ = 0; if A, is 
an r-fold multiple eigenvalue, then we can construct r mutually orthogonal 
eigenvectors corresponding to A^. The eigenvectors (x^)" can be scaled so 
that they arc miit vectors, and then they are orthonormal: 

x^x -<5 -I 1 *=^' 
' ^ ~ '^ " \ z 7^ j ■ 

The matrix X e iV/„ with columns (x^)" is an orthogonal matrix: it satisfies 

X^X = XX^ = I. 

The set of orthogonal matrices in M„ is denoted by 0„. 

The direct eigenvalue problem for A € 5^ is this: Find the eigenvalues 
of A. It can be shown that this is a well-posed problem. 

The definition of a well-posed problem is due to Jacques Hadamard. His 
1902 definition has three elements: 

Existence: the problem has a solution 

Uniqueness: the problem has no more than one solution 

Continuity: the solution is a continuous function of the data 

For the direct problem, find the eigenvalues of A, we know that A has 
a unique set of eigenvalues, so that Properties 1 and 2 are satisfied. Since 
the eigenvalues appear as roots of a polynomial equation whose coefficients 
are additive and/or multiplicative combinations of the entries of A, they 
will be continuous functions of these entries: Property 3 holds; the problem 
is well-posed. 



Matrix Inverse Eigenvalue Problems 



If Q G On and A € S'„, then B = Q-^AQ will have the same eigenvalues 
as A: if (1) holds, then 

Q^(A-AI)x==0. 

But Q^Ax = Q^A(QQ^)x = B(Q^x) = By, where y = Q^x and thus 

(B - AI)y = 0. 

We can also show that all B e Sn with eigenvalues (Ai)" have the form 
B = Q^AQ, where A is one such matrix; we can always take A = 
diag{Xi, A2, . . . , A„) = A. Write N = n{n + l)/2. The orthogonal matrices 
Q form an {N — n) — n(n — l)/2-parameter family: the inverse problem 
^Find A £ Sn with eigenvalues (A^)"', is not well-posed. We note that 
A e 5„ has N independent entries, and there are only n pieces of data 
(Ai)": there is an {N — n)-paramcter family of solutions. 

1.2 The Rayleigh Quotient 

If X e i?„ and A & Sn, the ratio 



i?=^ 



^Ax 



is called the Rayleigh Quotient, after Rayleigh (1894). The primary result 
relating to the Rayleigh Quotient is that the stationary values of R for 
varying vectors x are the eigenvalues of A. 

To prove this, we note that R is homogeneous of degree in x. We 
can write R as the value of x^Ax subject to x-^x =1. We can enforce 
the constraint x^x =1 by introducing a Lagrange multiplier A, and then 
investigate the stationary values of 

R = x^Ax — Ax^x. 

Since 

- — = 2{aiiXi + 021X2 + • • • OniXn) - 2XXi 

the equations dR/dxi for i = 1, 2, . . . , n may be collected to give 

Ax Ax = 

which states that A is an eigenvalue of A. 

Now return to the need to provide more data for the inverse problem, 
and suppose that we know the eigenvalues (^i)i ^^ of the matrix A obtained 
from A by deleting row 1 and column 1. The /i.; are the stationary values of 
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R subject to the constraint xi — 0. Since xi = is equivalent to x-^ei = 0, 
where ei — {1, 0, . . . , 0}, we can introduce two Lagrange muhiphers A and 
2/^, and consider the stationary values of 

R ~ x^Ax - Ax^x - 2/ix^ei. 

Now 

— — = 2 2^ o,ij^j ~ 2Aa;i — 2^(5a = 

OXi 

which may be combined to give 

Ax — Ax — /.tei = 0. 
Since the (x^)" span i?„, we may write x — X^iLi '^i^i ^^ ^^sX 

n 

^(Aj - A)a,x, - //ei = 0. 

Using the orthonormality of the x^, we deduce 

(Aj - X)ai = A^xf ei = fixu 
where xu denotes the first component of Xi. So 

a^ = fiXii/{X, - A) 
and the constraint xi = gives the equation 

" r 12 

This means the eigenvalues (/Xj)"~^ are the roots of this equation. If each 
Xu is non-zero, the graph of /(A) is as shown in Figure 1. The eigenvalues 
{f^i)i~^ interlace the eigenvalues (A^)": 

Xl < Hi < X2 < ^J.2 < ■ ■ ■ Mn-l < ^n- (2) 

(If, for a certain i, xu ~ 0, then Xi itself is an eigenvalue of Ai.) This anal- 
ysis shows that if we know the (xi^)" (and since the values Xu, X2i, • ■ • , Xni 
form a unit vector, Y^^=i[^uY' = 1) then we can find the (/^i)"~^. But the 
converse is true: if we know the (/J,i)"~^, and they interlace the (A^)", then 
we can find the {xii)^. For then 

\p [xii? ^ nr=/(/"»-A) 
^A,-A n:=i(A.-A) 
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Figure 1. /(A) has roots /Xi,/X2, . 



from which we deduce 



[Xli 



n;=i(A,-A,) 



(3) 



where ' denotes j 7^ i. Wc can verify that if the fij interlace the Ai, as in 



(2), th 



en [xii]'^ given by (3) is positive. 



We conclude that if we know (/^i)"" then we know the first entries in the 
vectors (x,)". If we know the (Aj)" and the (xi,)", we have 7i+?i — 1 = 2?i — 1 
pieces of data. (Note that since X]"=i[^ii]^ = 1' ^^'^ (^ii)i provide only n—l 
pieces of data.) 



1.3 Inverse Problems for a Jacobian Matrix 

A Jacobian matrix is a matrix in Sn with just three diagonals: 

oi bi 

61 a2 62 



bn- 






Since changing the sign of one of the bi has no effect on the eigenvalues, 
we can assume that (6^)""^ > or (fo^)"^^ < 0. If bi = for some i = 
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1,2,..., n— 1, then the eigenvalue problem splits into two separate problems, 
one for j = 1, . . . , i and one for j — i + 1, . . . ,n. There is therefore no loss 
in generality in assuming (6i)"~^ ^ 0. 

If X is an eigenvector of J, then the first line of (J — AI)x = is (ai — 
A)a;i + biX2 = 0. If xi = then X2 ~ 0, and then line 2 gives 0:3 = 0, 
and so on until a;„ = 0: x = 0, eontradicting the statement that x is an 
eigenvector. We deduce that {xvifi ^ 0. Moreover, as we showed in Section 
1, if we know (A^)", and (/Xi)"~^ and they interlace, then we can find (xii)". 

We can assemble the n column-vector equations (A — Ail)xj; = into 
one matrix equation 

AX = XA 



where 



X = {xi,X2,.. .,x„}. 



We know the first entries in each vector x^, i.e., the first row of X. If we 
transpose X to form U, i.e., X"^ — U, then 

AU^ ~ U"^A or, on transposing, UA = AU. 

Now write U = {ui, U2, . . . ,u„}. Since the first column u^ contains the 
entries in the first row of X, we know Ui. Write UA = AU as 



{U1,U2,...,U„} 



Ol 


bi 








hi 


0'2 


b2 


hn- 


bn~l 

an 



A{ui,U2,...,u„}. (4) 



The first column of this equation is 

oiUi -I-61U2 = Aui. 
Since the columns Ui, U2, . . . , u„ arc orthonormal, we deduce 



(5) 



ai ~ Ui Aui. 



Now write (5) as 



61U2 = Au — oiUi = Z2. 
The vector Z2 is known; U2 is to be a unit vector, so that 



Z2 



bi 



Z2 



Matrix Inverse Eigenvalue Problems 



and 

U2 = Z2/61. 

Now take the second column of (4): 

61U1 + 02 U2 + &2U3 = AU2. 

Proceeding as before, we find 

02 = uf AU2 
62U3 = AU2 - 61U1 - a2U2 = Z3 

from which we deduce 

&2 = ||Z3||, U3=Z3/62. 

We continue this procedure to find (fli)" and (6^)"^^. This procedure is 
called the Lanczos algorithm. (Note that we have taken the bi to be positive. 
If instead the bi are negative, then 61 = —11^211, ^2 = — IksIL and so on.) 

Actually, what we have described is an inverse version of the original 
Lanczos algorithm. The original algorithm solved the following problem: 
Given a symmetric matrix A and a vector xi such that x^xi — 1, compute 
a Jacobi matrix J and an orthogonal matrix X = {xi, X2, . . . , x,i} such that 
A = XJX . In our use of the algorithm, we start with A = A. 

For variants of the Lanczos algorithm, and alternative ways to deduce 
the entries in a Jacobi matrix from eigenvalue data, see Gladwell (2004), 
especially Sections 4.2 and 4.3. See Golub (1973) for an early paper on this 
algorithm. 

It may be shown that the inverse problem of constructing J from data 
(Ai)" and (xi^)" is well conditioned, but some ways of carrying out the 
computation are better conditioned than others. This is discussed at length 
in the works just cited. See also Boley and Golub (1984) and Boley and 
Golub (1987). 

The inverse eigenvalue problem for a Jacobi matrix can be linked to a 
mechanical inverse problem: find the masses (Ti^)" and spring stiffnesses 
{ki)i in the system shown in Figure 2, so that the system has natural fre- 
quencies {uji)i and, when the right-hand end is fixed, has natural frequencies 
{u}*)i~^, where (cji)" and {ll;*)"~^ interlace. (Notice that for convenience, 
we have turned the system round so that the right-hand end is first free 
and then fixed: when considering A, we considered A and Ai, the matrix 
obtained by deleting row 1 and colmiin 1.) The equation for the natural 
frequencies (wi)i i^ 

(K-AM)u = 0, A = w^ 
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Figure 2. The mass-spring system lias its riglit-hand end free (above) or 
fixed (below). 



where 



K = 



fci + fc2 -^2 

-^2 /S2 + ^3 



f^n 



, M ~ diag{mi,m2, 



. ,m„j 



— k h 

To reduce the problem to standard form, write 

M = D-, D^KD^=A, Du = x 

so that 

(A - AI)x = 0. 



(6) 



Having found the Jacobi matrix A by using the Lanczos algorithm, we have 
to isolate K and M from A. Equation (6) shows that K = DAD. By 
inspection, K has the property 

K{l,l,...,l} = fciei 

where ei = {1, 0, . . . , 0}. Therefore, 

DAD{1,1,...,1} = fciei 

and hence 



A{di,d2, ■ • ■ ^dn} = D ^fciGi = d^ ^kiei. 



(7) 
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We now proceed as follows. Take the unique reconstructed Jacobi matrix 
A, (we choose the bi to be negative, just as the off-diagonal entries of K 
are negative) and solve 

A{xi,a;2,...,x„} = ei. 

We can show that the solution, x, is strictly positive: a;i,a;2, . . . ,a;„ are all 
positive. The solution of equation (6) is d = ex for some as yet unknown c. 
The total mass of the system is 

n n 

8=1 i=l 

Thus, choosing the total mass, and computing ||x|p, we can find c, and 
hence d, and hence the masses rrii = df. Now we compute K = DAD 
which yields the stiffness fc^. 

2 Lecture 2. Applications and Extensions 

2.1 A Minimal Mass Problem 

In Lecture 1, we showed that a Jacobi matrix J could be reconstructed 
uniquely from (2n — 1) pieces of data: the eigenvalues of J, and of Ji, the 
matrix obtained by deleting row 1 and column 1 of J. The eigenvalues must 
interlace in the sense 

Ai < /ii < A2 < . . . /-i„-i < A„. (8) 

An equivalent set of data is (A^)" and (a;ii)". Here xu is the first component 
of the ith eigenvector x^; the xu are normalized so that 



El 
1=1 



xu]^ = 1. 



In Lecture 1, we showed how this inversion procedure could be used to 
construct a spring mass system such as that shown in Figure 2. GladwcU 
(2006) used this analysis to solve a minimal mass problem: of all possible 
spring-mass systems as shown in Figure 2(a), which have the given spectrum 
(A,)", find the one that minimizes the mass m = X)i=i "^i ^'^^ given stiffness 
k given by 

k ^-^ k 

i—l * 
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Since the system can be reconstructed from (A^)" and the last entries Uni 
in the eigenvectors, the problem reduces to finding those u„,; that minimize 
m/k. 

A somewhat intricate analysis shows that 



m 



l[xA ■Y^ix^D^iu^.ry'-Y^xi^u^.r, o) 



\i=l 



where 



Di ~ TT '(Ai — \j), and ' denotes i ^ j. 

To find the global minimum of m/k, we note that (9) is a homogeneous 
function of degree zero in the u„i. To find its minimum, we use the Cauchy- 
Schwarz inequality ||x|p||y|p > |x • yp to deduce that the global minimum 
occurs when 

for some constant c. Thus 

K,)' = c2Arii?r2, [u^,]^^cXr'/^D-\ (10) 

and the minimum is 

Since the minimizing values of the u„i are known in terms of the eigenvalues 
data, (see (9)), the values of the individual masses rrii and stiffness ki may 
be found as in Section 1.3. 

This minimal mass problem is interesting because it shows that it is 
possible to set up an inverse eigenvalue problem for a spring-mass system 
for which the given data is one spectrum not two. 

Note that it is not possible to construct a Jacobi matrix from its eigenval- 
ues {Xi)i and one eigenvector x^ using a variant of the Lanczos algorithm. It 
is possible to construct a spring mass system which has a given mode shape 
for ith mode, but if this reconstruction is to be realistic, that is, to have 
positive masses and stiffnesses, then the ith mode shape u^ must satisfy cer- 
tain strict conditions on the sign properties of the sequence un, U2i, ■ ■ ■ , u^i- 
One condition is that uu ^ 0, Uni ^ as we found in Section 1.3. Also, if 
one Uji = then the neighbouring entries Uj-i^i and Mj+i.i must be strictly 
non-zero and have opposite signs. When these conditions are fulfilled then 
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we can give a precise definition for the number of sign changes in the se- 
quence uii, U2i, ■ ■ ■ , Uni', the ith mode must have exactly i — 1 sign changes. 
Thus, the first mode is positive, the second has one sign change, and so on. 
Not only must U; have this property, but also the w^ where w — E-^u. and 



E 



1 i 
1 i 



(11) 



where i = -1. Thus, wu = "li, W2i = U2i - "h, ■ ■ • ,Wni = Uni - u„-i,i. 
It can be shown (GladwcU (2004), p. 207) that if uijWij > 0, and the 
sequences (wij)"^]^ and {uij)"^i have exactly j — 1 sign changes, then there 
is a spring mass system of the form in Figure 2(a) that has {uij)f^i as its 
jth mode. It can be shown that there is a family of such systems. 

If two mode shapes are known, then providing that much more stringent 
conditions are satisfied, again it is possible to construct a spring-mass system 
which has those modes as eigenmodes; under certain conditions, this system 
is uniquely determined. See Vijay (1972) and Gladwell (2004). 

2.2 Another Spring- Mass Inverse Problem 

The inverse problem we have studied involves finding the stiffnesses (A:^)" 
and the masses (m^)" from two spectra (w,)" and {uj*)^. We note that 
multiplying all the stiffnesses and masses by a constant c does not alter 
the spectra, there are only 2n — 1 ratios to be found; we can fix one of the 
quantities (/c^, to^)", or fix, say, the total mass m = ^^ mi. It is tempting to 
suppose that an easier inverse problem would be to fix the masses (m^)", 
and find the n stiffnesses (fc^)" from one spectrum ((jJ^)". However, even 
for n = 2, it is not certain that there is a system with given mi, 7712, Ai, A2, 
(where Ai — ujf). The eigenvalues are the roots of 



ki + k2 — Ami — fc2 

— fc2 ^2 — A?n2 



= 0. 



If the eigenvalues Ai,A2 and the masses mi,m,2 are known, then fci 
(TOiTO2AiA2)/fc2, and /c2 is a root of the quadratic equation 



TOi + m2 

77117712 



hi - (Ai + A2)A:2 + TO2A1A2 = 0, 



12 



G.M.L. Gladwell 



which has real positive roots if and only if 

(Ai-A2)2>4^AiA2. 

This essentially states that the eigenvalues should not be too close to each 
other. For n = 3 the analysis becomes formidable, and essentially in- 
tractable: it is necessary to find the conditions on the data which will ensure 
that a certain sixth order polynomial equation has a real positive root. This 
problem is very similar to the so-called inverse additive eigenvalue problem: 
Given A G 5„ , find a diagonal matrix D such that A -|- D has eigenvalues 
(Ai)"; see Chu and Golub (2005), Friedland (1977). It is somewhat curi- 
ous that the inverse problem in which the 2n quantities (fc^, tti^)" are to be 
found is much simpler than that in which only (fcj)" are to be found. In 
a way, this should not be surprising because the quantity in equation (2) 
is (a multiple of) the response function of the system, the data (A^)" and 
(/Xi)" are the poles and zeros of this response function and it is known that 
the poles and zeros of the response function must interlace. Essentially, the 
system is reconstructed from its response function. 

2.3 Inverse Problems for a Pentadiagonal Matrix 

In Section 1.3, we showed that we could construct a Jacobian matrix A (a 
tridiagonal matrix) from the spectrum (A^)", making up A, and the first (or 
last) components of the normalized eigenvectors of A. We can easily extend 
the Lanczos algorithm to the reconstruction of a pentadiagonal matrix A 
from its eigenvalues, making up A, and the first two components of the 
normalized eigenvectors of A. Now equation (4) is replaced by 

Qi bi Ci 

h 02 62 C2 

ci 62 as ^3 C3 



[Ui,U2,...,U„J 



C„-2 

bn-1 
an 



A[Ui,U2,...,U„]. 



The first column of this equation is 

aiUi -I- biU2 + C1U3 = Aui. 
If Ui, U2 are known (and they must satisfy ufuj ~ 6ij, for i, j = 1, 2) then 



ai — Ui Aui, bi ~ U2 Aui 
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and 



and lluall — 1 gives 



C1U3 = Aui - aiui - 61U2 = Z3, 



Cl 



IZ3I 



U3 — Z3/C1, and so on. 



Formally, this analysis appears to be correct, but in practice, the matrix 
that is being reconstructed has a particular form, and in particular, has 
sign properties. 

A typical case in which this pcntadiagonal inverse problem arises is the 
inverse problem of a beam modelled as point masses attached by torsional 
springs to rigid rods. Now the stiffness matrix K has the form 



T, 



K = EL^EKE L E 



-IttT 



where 



K = diag{ki,k2, ■ ■ ■ ,fc„), L = diag{ii,i2, ■ ■ ■ ,(n) 



and E is given by (11). The eigenvalue problem is (K — AM)x = 0, which 
reduces to (A - AI)y = where A = B^KB^, and M = D^. The 
data must be chosen so that all the quantities (fc^, rrii, £,;)" are positive. The 
necessary conditions on the data, essentially making up Ui and U2, were 
found by Gladweh (1984), Gladwell (2004). Finding the conditions on the 
data is actually much harder than the formal reconstruction of the matrix 
A, and hence (fci,mi,£i)". 

2.4 Periodic Jacobi Matrices 

A Jacobian matrix is a tridiagonal matrix of the form (4). A periodic 
Jacobi matrix has the form 





ai 


bi 








bn 




bi 


0.2 


b2 








*iper 












bn-1 




bn 






bn- 


-1 


an 



It is called periodic because it is as if it were inscribed on a cylinder, and 
bn appears to the left of ai and to the right of a„. The graph underlying 
Jper is a ring on n vertices. 

Boley and Golub (1987) showed that Jp^r could be reconstructed from 
(Ai)", the eigenvalues of Jper', (mj)"^ , the eigenvalues of (Jper)i, and one 
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extra piece of data, /3 — bib2 ■ ■ -bn- Now, however, there are conditions on 
the data which wih ensure that there is a real solution; these were found by 
Xu (1998); see also Boley and Golub (1984) and Andrea and Berry (1992). 

2.5 Graph Theory 

The proper study of matrix form, i.e., the pattern of zero and non-zero 
entries in a matrix, requires a knowledge of graph theory. A graph ^ is a 
set of vertices, Pi, connected by edges, {Pi, Pj). The set of vertices is called 
the vertex set, and is denoted by V; the set of edges is called the edge set 
£. A graph is said to be simple if there is at most one edge connecting 
any two vertices. The graph is undirected if there is no preferred direction 
associated with an edge. With any symmetric matrix A we may associate 
an undirected graph; the rule is 



if i ^ j then (P„ Pj) e£ iff a^j ^ 0. (12) 



A matrix A € Sn is said to be irreducible if it cannot be reduced to the 
form 



B 



by any rearrangement of rows and columns. 

A graph Q is said to be connected if there is a chain consisting of a 
sequence of edges connecting any one vertex to any other vertex. It can 
be shown that A € Sn is irreducible if and only if its underlying graph is 
connected. 

A tree is a connected graph with no cycles. The simplest tree is a path, 
a set of vertices (P,)^' with n - 1 edges (Pi, P2), (^2, P3), • ■ • , (Pn-i, Pn)- A 
Jacobi matrix is associated with a path, while that associated with a periodic 
Jacobi matrix is a cycle with edges (Pi, P2), (-P2, ^3), ■ • ■ , {Pn-i,Pn), {Pn, Pi)- 

Duarte (1989), in an important paper, solved the inverse problem for a 
tree, i.e., for a matrix A € Sn whose underlying matrix is a tree. He showed 
that, given two interlacing spectra (A;)" and (/ii)"~^, it is possible to find a 
matrix A with underlying graph a tree, such that (A^)" and (/ii)"" are the 
spectra of A and Ai respectively. For an example, consider the tree shown 
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in Figure 3, for which A has the form 



A = 



XX X 

ry ry rf» ry ry 

tAj tij iA-r tAJ *X> 

X X 

X XX 

X X 

X XXX 

X X 

X XX 

X X 



Given (A^)" and (p-i)", we find aii,ai2,aig and we find sufficient new 




Figure 3. The graph underlying A is a tree. 



data to give a22, 023, 124, 025 on the one hand and age, 057 j oes on the other. 
Finally, we find agg, ogg. Further discussion of, and references to, eigenvalue 
problems related to trees may be found in Nylen and Uhlig (1994). 

The defining characteristic of a path is that the graph has an end; one 
can start at this end and proceed to the other end. For a tree, there is also 
an end, a root. One can start at this end and proceed to a vertex at which 
the tree splits into two or more subtrees; again one can proceed along each 
of these subtrees, and so finally reach each vertex. 

Unfortunately, the stiffness matrices obtained through a finite element 
model of a two- or three- dimensional solid structure are in general not trees 
- they are based on triangulation or some other similar process. There is no 
'end' of the underlying graph from which one can start. 

The simplest example of a matrix whose underlying graph is not a tree 
is a periodic Jacobi matrix. Just as the inverse problem for a periodic 
Jacobi matrix requires that rather curious extra piece of data, /3, so the 
inverse problem for a matrix with underlying triangulated graph requires 
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more data than just two spectra. Such inverse problems are currently still 
open. 

3 Lecture 3. Isospectral Systems 

3.1 Reversing Factors 

Two vibrating systems that have the same natural frequencies are said to 
be isospectral. Similarly, two matrices A, B G S'n with the same eigenvalues 
are said to be isospectral. 

In Lecture 1, we showed that if A e 5„ has eigenvalues (A,)" making 
up A = diag{Xi, X2, ■ ■ ■ , Xn), and normalized eigenvectors x^ making up 
X = {xi,X2,. .. ,x„}, then 

AX == XA, XX^ = X'^X = I, 

so that 

A = XAX"^. 

If B g S'n also has eigenvalues (A^)", then 

B = YAY'^, 

= YX'^(XAX'^)XY'^ = QAQ"^, 

where Q = YX^ is an orthogonal matrix (QQ'^ = YX^(XY^) = YY^ = 

I). 

Conversely, if B = QAQ^ and AX = XA, then 

B(QX) = QAQ^(QX) = QAX = (QX)A, 

so that B has the same eigenvalues as A. 

This shows that all the matrices B S S'„ that are isospectral to A are 
given by B = QAQ for some orthogonal matrix Q. 

However, if A has some special form, e.g., tridiagonal, pentadiagonal, 
then B = QAQ will share this form only for special orthogonal matrices 

Q 

To obtain a class of matrices B G Sn that are isospectral to A e 5„, we 
recall a simple but far-reaching result. 

// A,B S M„, then AB and BA have the same eigenvalues, except 
perhaps for zero. To prove this, suppose A 7^ is an eigenvalue of AB so 
that, for some x 7^ 0, ABx — Ax. Since A 7^ and x 7^ 0, we have Bx 7^ 0, 
and B(ABx) — ABx, i.e., BA(Bx) — ABx: A is an eigenvalue of BA with 
eigenvector Bx. Now reverse the roles of A and B to complete the proof. 
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To use this result to obtain a system that is isospectral to the system 
show in Figure 2(a), write K and M of equation (7) in the form 



K = EKE 



K 



M = D2, 



where 



F = diag{fi, f2., ■ ..,/„), D = diag{di,d2, ■ ■ . ,d„). 
Thus (K - AM)u = is reduced to (A - AI)x = 0, where 

A = D-^EF^E^D-i = (D-iEF)(FE^D~i). 

Now reverse the factors in A to form B: 

B = (FE'^D-1)(D-1EF). 

Thus matrix B is a Jacobi matrix; D"-'^ and F have been interchanged, as 
have E and E-'". It may be verified that the isospectral system related to 
B is a spring mass system free at the left-hand end and fixed at the right. 
This reversing of factors may be used in numerous contexts. 



3.2 The QR Algorithm 

If A e M„ is non-singular, then it can be factorized in the form 

A = QR, 

where Q is orthogonal, and R is upper triangular with positive diagonal. 
QR factorization is equivalent to applying Gram-Schmidt orthogonalization 
to the columns of A. If A = QR, then B = RQ = (Q'^Q)(RQ) = Q'^AQ. 
Thus A and B are isospectral, and if A G Sn, so is B. What is more 
important, however, is that A and B often have the same form. To see this, 
we note that A and B are linked by 



RA = BR. 

Suppose A is a Jacobi matrix, as in (4), then 



(13) 



ni ri2 ri3 
r33 



Tin 
r2n 



h 02 62 



bn-1 
bn-1 In 
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bii bi2 

^21 ^22 



bin 
b2n 



rii ri2 

?'22 



rin 

r2n 



Column 1 row 2 gives r22bi = ?'ii&2i; since rii,r22 are positive, 621 has the 
same sign as &i . If j > 3 then column 1 , row j gives 

= bjirii, 

which imphes bji ~ Q for j > 3, and since B is symmetric, bij — for j > 3. 
This argument may be continued to show that B is a Jacobi matrix. 

QR factorization and reversal may be given more power by introducing 
a shift, /i. Suppose /i is not an eigenvalue of A € S'„, and that 



and 
then 

and again 



A ^I = QR, 



B /il = RQ, 



(14) 



(15) 



B 



/il + RQ = Q^Q(/iI + RQ), 
= Q^(/iI + QR)Q = Q^AQ, 

RA = ^R + (B - /^I)R = BR. 



(16) 



We can use this result to obtain a new way of displaying all the Jacobi 
matrices isospcctral to a given one. First, we recall that we can reconstruct 
a Jacobi matrix from its eigenvalues and the first (or last) components Xu 
(or Xni) of the normalized eigenvectors x^. Since X^I'^i*]^ = ^j ^^ may 
picture a set of values (a;ii)i ^-s a point on the unit sphere in i?„. Each 
point gives a Jacobi matrix. 

Now consider the other way. Wc can consider the two step process (14), 
(15) as defining an operator t^^ which takes A into B: B = ^^^A. Suppose 
that A is a Jacobi matrix with positive off diagonal, then Q^ is commutative 
when applied to A, i.e., 

g,,,g^,A^g^,g^,A. (17) 

To prove this, suppose that u is a normalized eigenvector of A: 



Au = Au, 
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then 

BQ^u = (Q^AQ)Q^u = Q^Au = AQ^u, 

so that u* = Q-^u is a normalized eigenvector of B. We may express this 
eigenvector in another way. Since 

Au = (QR + /il)u = Au, 

we have 

QRu = (A — ^)u, 

or 

rp RU 



u* = Q' u = 



A — /i 
This equation shows that the last component of u* is 



" A-/i' 
and the last component of the eigenvector u* may be taken to be 

This shows that, under the operation tj^, the last components of the eigen- 
vectors are simply multiplied by two terms, one, r„„(/i), independent of i, 
and the other |Ai— /i|~^. This means that the last components of the normal- 
ized eigenvectors of either of the two matrices in (16) will be proportional 
to 



|Aj -^i||Aj -^2!' 

Since they are proportional, and the sum of squares of each set is unity, the 
two sets must be the same. But a Jacobi matrix is uniquely determined 
by its eigenvalues and the last components of its normalized eigenvectors. 
Therefore, (17) holds, and tj^ is commutative. 

We can now prove that if A, B are two isospectral Jacobi matrices with 
positive off-diagonal terms, then we can find a unique set (/ii)"" such that 

/il < /i2 < .. . < fln-l, 

and 

QiiiGt,2 ...t;^„_iA = B. 
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It is sufficient to show that wc can pass from one set of last components 
(uni)! to any other set (fni)i ^^ l*^ ~ 1) St^ operations. But equation (16) 
shows that this is equivalent to choosing (/ij)""^ such that 

n-l 

TT TT — - — r avni, i = l,2,...,n. 
This is equivalent to choosing the polynomial 



such that 



A^iJ 



\P{X.i)\ ^ Uni/Vni 1 = 1,2, ...,n. 

If we choose the /ij so that 

Ai < /ii < A2 < . . . < /^„-l < Xn, (19) 

then 

^(A.) = i-T^'Urn/Vn, i = 1, 2, . . . , n. 

But there is a unique such polynomial P{X) of degree n — 1 having values 
of opposite signs at n points Ai, and it will have roots fii satisfying (17). 

This means that, given A,B, we may pass from A to B in n — 1 steps 
Gfj, , and the order in which we take these steps is immaterial. 

We used the equation RA — BR to show that if A is a Jacobi matrix, 
then so is B. There is a more general result relating to a so-called staircase 
matrix. A symmetric staircase matrix has its non-zero entries clustered 
around the diagonal as in a staircase, as shown in Figure 4. It is easily 
shown that if A € 5'„ is a staircase matrix, and B = Qfj.A., then B is a 
staircase matrix with the same pattern. Arbenz and Golub (1995) showed 
that staircase patterns are effectively the only ones invariant under the 
symmetric (unshifted) QR algorithm. 

3.3 Positivity 

Not only do the matrices appearing in physical problems have a par- 
ticular pattern of zero and non-zero entries, but they also have particular 
patterns of positive and negative entries. We limit our discussion to sym- 
metric matrices A € Sn- 

The definition positive definite is well-known: A e S'„ is said to be 
positive definite PD (positive semi-definite PSD) if x-'^Ax > 0(> 0) for all 
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X X 








X 


X 



Figure 4. The graph underlying A is a tree. 



X £ i?". The stiffness matrix K of an (undamped) anchored system is PD; 
X"^Kx is related to the potential, or strain, energy of the system. It is 
well-known that x"^Ax > if and only if all the principal minors of A are 
positive. A principal minor of A is a minor taken from a set of columns and 
the same set of rows. A e Sn is PD if and only if its eigenvalues (A^)" are 
all positive. 

Now consider the concept of a positive matrix. If A G M„, then 

A > means that each entry in A is non-negative 
A > means A > and at least one entry is positive 
A >> means that each entry in A is positive. 

Let 

Z„ = {A : A e il/„; a,^ < 0, i^ j}. (20) 

If A e Z„, then it may be expressed in the form 

A = sl-B, s>0, B > 0. (21) 

The spectral radius of B € Af„ is 

p(B) — max{|A|; A is an eigenvalue of B}. 

A matrix A G i\/„ of the form (21) with s > p(B) is called an M -matrix] if 
s > /o(B) it is a non-singular A/-matrix. 

Berman and Plcmmons (1994) construct an inference tree of properties 
of non-singular Af-matrices; one of the most important properties is that 
A""'^ > 0; each entry in A~^ is non- negative and at least one is positive. 

A symmetric non-singular A/-inatrix is called a Stieltjes matrix, and 
importantly A (z SnD Z„ is a Stieltjes matrix iff it is PD. Moreover, if A 
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is an irreducible Stieltjes matrix then A^^ >> 0: each entry in A~^ is 
strictly positive. This result has important consequences for some physical 
systems. For a physical system, like an on-line spring mass system, or a 
finite element model of a membrane in out-of-plane movement, the relation 
between static displacement and applied force has the form 

Ku = f. 

For an anchored spring-mass system, the stiffness matrix is an irreducible 
non-singular i\/-matrix, so that K^^ >> 0. This agrees with our intuition 
that a positive force applied to any mass will displace each mass in the 
positive direction. Gladwell ct al. (2009) showed that the analogue of this 
result holds for a FE model of a membrane. 

For in-line spring-mass systems, and for the model of a beam in flexure, 
the stiffness matrix satisfies more stringent conditions which ensure that 
the eigenvalues arc distinct, that the first eigcnmodc has no node, that 
the second mode has one node, etc. The matrix property is so-called total 
positivity. The definitions are as follows, and they refer to A G M„, not 
just to A € S'„. A matrix A € Af„ is said to be 

TN (totally non-negative) if all its minors are non-negative^ 

NTN if it is non-singular and TN, 

TP (totally positive) if all its minors are positive, 

O (oscillatory) if it is TN, and a power A™ is TP. 

To these, we add one more: let Z = diag{+l, —1, -1-1, . . . , (— )"^^), then 
A = ZAZ is called the sign-reverse of A. 

SO (sign-oscillatory) if A is O. 

Clearly, if A e Sn is TP, then it is PD, but it can be shown that if 
A e 5„ is only NTN it is PD. 

If A € M„ then A is NTN, TP or O iff (A)-i is NTN, TP or O. 
Moreover, if A is SO, then A^^ is O, and vice versa. 

Let us apply these results. The stiffness matrix K of an in-line spring 
mass system is a PD Jacobi matrix with negative off-diagonals; its sign 
reverse K is a PD Jacobi matrix with positive off-diagonal entries. The 
principal minors of K arc positive, and it can be shown that its other minors 
are non-negative. If we form powers, K^, K"^, . . . , K""-'^, then K"^^ is full. 
It can be shown that K"^^ is PD, so that K is O, and K is SO, and K~^ 
is O. 

Total positivity is linked to staircase form through the result: If A g S'„ 
is NTN, it is a staircase matrix. 
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Gladwell (1998) proved that if A e S'„. if P denotes one of the properties 
NTN, O, TP, and if B is related to A by equations (14), (15) with ^ not an 
eigenvalue of A, then B has property P iff A has property P. This shows 
that these properties NTN, O, TP are invariant under QR factorization 
and reversal; in turn, this means that the minors of B will have the same 
sign properties as the minors of A. 

4 Lecture 4. Toda Flow 

4.1 The Basic Equations 

If A, B G M„, we denote the Lie bracket of A, B by 

[A,B] = AB~BA. 

We note that if A e 5„, and B e Kn, i.e., A is symmetric and B is 
skew-symmetric, then [A,B] e 5„, i.e., [A, B] is symmetric. 

We recall a basic result from Nanda (1985). 

Let S(t) € Kn be a skew symmetric matrix defined on — oo < t < oo, 
i.e., S"^ = — S. Let Q(i) be the solution of 

^ = -SQ, Q(0)=I, (22) 

then (i) Q(t) is an orthogonal matrix for — oo < t < cxj. 
Let A(t) be the solution of 

HA 

— = [A,S], A(0) = Aoe5, (23) 

then (ii) 

Ait) = Q(i)AoQ^(i), 

where Q is the solution of (22). 
Proof (i) 

= Q^S^Q-Q^SQ, 

= -Q^(S^ + S)Q = 0. 



Therefore 



Q^ (t)Q(t) == constant = Q' (O)Q(O) = I 



Q(t) is an orthogonal matrix, 
(ii) 

^(Q'^AQ) = -(Q'^S'^)AQ + Q'^(AS - SA)Q + Q^A(-SQ) 
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Hence Q'^AQ = const = Q'^(0)AoQ(0) = Aq, and 

A{t) = QAoQ^. 
The flow of A(t) according to (23), i.e., 

A = AS - SA, 



(24) 



wfiere S"^ = — S is an orthogonal matrix, is an isospectral flow, called a Toda 
Flow. The analysis had its beginning in the investigation of the so-called 
Toda lattice, Toda (1970). See Chu (1984) or GladweU (2004) for a review 
of the literature. 

Historically, Toda flow was applied first to a Jacobi matrix. Suppose 
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Then it can easily be verified that, as it flows, A remains a Jacobi matrix, 
i.e., if \i — j\ > 1 then hij = 0. Since a^ (0) = if |i — j| > 1 then aij{t) = 
for all t. The off-diagonal entries bi satisfy 



(a* 



H+l 



)h 



1,2, 



1, 



which implies that bi retains its sign: if &i(0) > then bi{i) > 0. 

We conclude that in this simple example, the matrix A retains both its 
zero/non-zero structure and its sign properties. We can therefore use Toda 
flow to construct spring-mass systems which vary over 'time' so that their 
eigenvalues remain the same. We use the Toda flow to find A{t), and then 
decompose A into M and K as described in Section 1.3. 
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There is a little-known Toda flow that keeps periodic Jaeobi form invari- 



ant: 



A = 
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We noted earlier that a periodie Jaeobi matrix is somehow like a Jaeobi 
matrix inseribed on a cylinder: 6„ appears to the left of Oi and to the right 
of a„. In S, the entry 6„ on the first row is treated as if it were in the lower 
triangle, and assigned negative sign; similarly the entry 6„ on the last row 
is treated as if it were in the upper triangle, and given a positive sign. 

We conclude that Toda flow of a periodic Jaeobi matrix is no more 
complicated than flow of a Jaeobi matrix; this contrasts with the difficulty 
of the algebraic solution to the inverse eigenvalue problem for the periodic 
Jaeobi matrix, as described in Section 2.4. 



4.2 Application of Toda Flow 

If A e Sn then we define A+ as the upper triangle of A. If S = 
A+ — A+-^, then S is skew symmetric. Now we can show that if A(0) 
is a staircase matrix, then so is A(t), the solution of (23), with the same 
staircase form as A(0). Let P denote one of the properties TN, NTN, TP, 
O, SO. If A(0) e Sn has property P then A(t), the solution of (23) with 
S = A+ — A^"'", has the same property P: all these properties are invariant 
under Toda flow. 

While these are interesting and important properties of Toda flow, they 
apply only to staircase matrices, and such matrices appear only when the 
physical system is in a certain sense an in-line system, such as a rod or a 
beam: its underlying graph is a generalized path. To find a skew symmetric 
S that ensures that, as it flows, A(i) lies on a certain graph, we must modify 
the concept of Toda flow. 

In Section 2.5, we defined the associated graph, ^(A), of A e 5„ : 
{Pi,Pj) e £ iff Uij 7^ 0. We need a somewhat looser term: the matrix 
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A € 5„ is said to lie on Q if aij ~ whenever [Pi, Pj) ^ £; Oi^- may be zero 
even when {Pi,Pj) € £. 

Let T € /C„ be defined by T = {tij) where 



tij 



If S e JCn, then T o S G 5„, where o denotes the Hadamard (entry wise) 
product. If S e K-n and T o S hes on tj, we will say that S lies on Q. 

Given a graph Q, we may form its complement Q; this is the graph on 
the same vertex set V{G), but is such that two vertices of G are adjacent 
ifi^ they arc not adjacent in Q. If A e 5„ and Q is a. graph with V{G) = 
{Pi, P2, . . . , Pn} then we may partition A in the form A = Ai + A2, where 
Ai lies on G and A2 lies on G- Thus, the subscript 1 denotes the restriction 
of A to G, and the subscript 2 denotes the restriction to G, the complement 
of G ■ For definitencss, we place the diagonal entries of A in Ai. Similarly, 
if S e /C„ we may partition S in the form S = Si + S2, where Si lies on G 
and S2 lies on G- 

Now return to equation (24). If A(0) lies on a graph G, then A(i), 
governed by (24), will not in general remain on G', we must choose S S JCn 
to constrain A(i) to remain on G- We may write equation (24) as 

Ai + A2 = [Ai + A2, S] = [Ai, S] + [A2, S]. (25) 

Now choose S so that [Ai, S] lies on G, i-C, 

[Ai,S]2 = 0. (26) 

Since A(0) is on G, so that A2(0) = 0, equations (25), (26) show that 

A2 = [A2,S]2, A2(0)=O 

which has the unique solution A2(t) = : A(t) is on G, i-c, A(t) = Ai(t), 
and 

Ai = [Ai,S]i. (27) 

We note that the equations governing the isospectral flow of Ai(t) have 
been reduced to the differential equation (27) on G, and the simultaneous 
linear algebraic equations, (26), i.e., 

[Ai,Si]2 + [Ai,S2]2-0, 
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for S on Q. In general, S will not lie on Q, i.e., S2 ^ 0. What we found 
earlier, for the Jacobi, and staircase matrices was that there was a solution, 
namely 

S = TOA 

which lay on Q. We found also that there was a solution S for a periodic 
Jacobi matrix which lay on Q. Gladwell and Rojo (2007) showed that the 
S for a generalized Jacobi matrix could be generalized to give S on C^ for a 
so-called periodic staircase matrix. 

In general, S will not lie on Q. In that case we take Si = TOAi, find 
S2 so that (26) holds, and then find Ai from equation (27). Gladwell 
et al. (2009) implement this procedure to find an isospectral flow of a finite 
element model of a membrane. 
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An Introduction to Classical Inverse 
Eigenvalue Problems 



Antonino Morassi 
Via Bortulin 9, Liariis di Ovaro (Ud), Italy 

Abstract Aim of these notes is to present an elementary intro- 
duction to classical inverse eigenvalue problems in one dimension. 
Attention is mainly focused on Sturm-Liouville differential opera- 
tors given in canonical form on a finite interval. A uniqueness result 
for a fourth order Euler-Bernoulli operator is also discussed. 

1 Introduction 

Eigenvalue inverse problems are primarily model problems simulating situ- 
ations of practical interest. Typically, in an inverse eigenvalue problem, one 
measures a certain number of natural frequencies of a vibrating system and 
tries to infer some physical properties of the system under observation. 

To fix ideas, consider a thin straight rod of length L, made by homoge- 
neous, isotropic linear elastic material with constant Young's modulus E, 
E > 0, and constant volume mass density 7, 7 > 0. The free vibrations of 
the rod are governed by the partial differential equation 

l{EAi^f-^)-lM^f-^-0, (.,t)e(0,L)x(0,oo), (1) 

where A ~ A{x) is the area of the cross section and w = w{x, t) is the 
longitudinal displacement of the cross section of abscissa x evaluated at the 
time t. The function A = A{x) is assumed to be regular (i.e., C^([0, L])) 
and strictly positive in [0, L\. The end cross-sections of the rod are assumed 
to be fixed, namely 

w{0,t)^O^w{L,t), t>0. (2) 

If the rod vibrates with frequency lu and spatial shape X = ^"(2;). i.e. 
w{x, t) = X(x) cos{Lut), then the free vibrations are governed by the bound- 
ary value problem 
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r iAix)X'{x)y + XAix)X{x) = 0, in (0, L), (3) 

\ ^(0) = = X{L), (4) 

where A = fw^ and X e C^{[0,L]) \ {0}. The (real) number A is called 
eigenvalue of the rod and the corresponding function X = X{x) is the 
eigenfunction associated to A. The pair {X,X{x)} is a Dirichlct cigenpair. 
For example, if A{x) = const, in [0, L], then A„ = (t^) ^-^id Xn{x) = 
;^sin^, n> 1. 

We find convenient to rewrite the problem (3)- (4) in the Sturm-Liouville 
canonical form by putting 

yix) = ^'A(x)Xix). (5) 

Then, the new eigenpair {X,y{x)} solves 

r y"(x) + Xy{x) = qix)y{x), in (0, 1), (g) 

j y(0) = = y(l), (7) 

where the potential q = q{x) is defined as 



9('^) "" AT^^' ™(0,1), (8) 

and where, to simplify the notation, wc have assumed L — 1. 

The direct eigenvalue problem consists in finding the eigenvalues {A„}5^]^ 
and eigenfunctions {yn(a;)}$^i of (6)-(7) for a potential q = q{x) given in 
[0, 1] or, equivalently, for a given cross section A = A{x) of the rod. 

Conversely, the inverse eigenvalue problem consists, for example, in find- 
ing information on the potential q = q(x) in [0, 1] given the Dirichlet spec- 
trum {A„}J^j of the rod. 

Inverse eigenvalue problems for Sturm-Liouville operators Ly = —y" + 
qy on a finite interval can be considered as a rather consolidated topic in 
the literature of dynamical inverse problems. Fundamental contributions 
were given in this field by Borg, Levinson, Krein, Gel'fand and Levitan, 
Marchenko, Hochstadt. Hald, McLaughlin, Trubowitz and others, see the 
books by Levitan (1987), Poschel and Trubowitz (1987) and Gladwell (2004) 
for a comprehensive presentation. 

Aim of these lectures is to present an elementary introduction to these 
inverse problems. In the first lecture, some general properties of the direct 
eigenvalue problem and asymptotic estimates of eigenvalues and eigenfunc- 
tions are recalled. The second lecture is devoted to the presentation of some 
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classical uniqueness results obtained by Borg (1946). The third lecture il- 
lustrates the approach by Hochstadt (1973) in the absence of data sufficient 
to ensure the uniqueness of the inverse problem, and a first stability result. 
A local existence result and a reconstruction procedure, still due to Borg 
(1946), are discussed in the fourth lecture. Finally, the fifth lecture is de- 
voted to an inverse problem for the Euler-BernouUi operator of the bending 
vibration of a beam given by Elcrat and Papanicolaou (1997). 

In all the lectures I plan to give the general ideas of the methods in- 
stead of the exact proofs. These were included in the notes delivered during 
the course and, in any case, can be tracked down from the original pa- 
pers cited in the references. The presentation of the arguments is almost 
self-contained and prerequisites are basic knowledge of functional analy- 
sis (Brezis (1986), Friedman (1982)) and complex analysis (Ahlfors (1984), 
Titchmarsh (1939)). 

In selecting the topics for presentation I had to take into account the 
limited time available for a rather extensive subject. I was forced to sacrifice 
some arguments (for example, the theory by Gel'fand and Levitan (1955), 
see also Levitan (1968)) that would have merited more attention, and to 
omit some other topics of current interest, such as, for example, isospectral 
systems (Poschel and Trubowitz (1987), Coleman and McLaughlin (1993) 
and Gladwell and Morassi (1995)) and inverse nodal problems (Hald and 
McLaughlin (1989)). 

2 Properties of the Dirichlet Eigenvalue Problem 

We begin by recalling some basic properties of the Dirichlet eigenvalue 
problem (6)-(7) for real-valued square summable potential q on (0,1), i.e. 

(zei2(o,i). 

i) There exists a sequence of Dirichlet eigenvalues {A„}J^]^. The eigen- 
values are real numbers and lim„_>oo Ki = -l-oo. 

ii) The eigenvalues are simple, that is 

Ai < Aa < ... < A„ < ... (9) 

and the eigenspace Un associated to the nth eigenvalue A„, n > 1, is 
given by 

U„ = span{.g„}, (10) 

where gn = g-nix) is the eigcnfunction associated to A„ satisfying the 
normalization condition L g'^{x)dx — 1. 
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iii) The family {gnj^i is an orthonormal basis of the space V of continuous 
functions which vanish at a; = and a; = 1, that is: 



1 







9ni^)9m{x)dx^S„^,^{ I '^1^2] n,m>l, ^^^^ 



and for every f (z V the series 

'cngn{x), with c„ = / f {x)g„{x)dx, (12) 



oo „i 










converges uniformly to / in [0, 1]. 
The above properties can be deduced from abstract spectral theory for self- 
adjoint compact operators defined on Hilbert spaces, see Brezis (1986) and 
Friedman (1982). An alternative approach is based on fmictional theoret- 
ical methods and specific properties of the Sturm-Liouville operators, see 
Titchmarsh (1962). 

As we will see in the next sections, the asymptotic behavior of eigenpairs 
plays an important role in inverse spectral theory. With reference to the 
Dirichlct problem (6)-(7), let us consider the solution y = y{x,X) to the 
initial value problem 

y"ix) + Xy{x) = q{x)yix), in (0, 1), (13) 

y{0) = 0, (14) 

2/'(0) = 1, (15) 

for some (possibly complex) number A and for a (possibly complex-valued) 
L^ potential q. By considering the right hand side as a forcing term, y can 
be interpreted as the displacement of an harmonic oscillator with frequency 
vA. Then, by the Duhamel's representation, the function y is the solution 
of the Volterra linear integral equation 

y{x, X) = ""^ ^^^ + 4f / sin%/A(a: - t)q{t)y{t)dt, x^[Q,l]. (16) 
V A V A JO 

It can be shown that there exists a unique solution belonging to C^([0, 1]) 
of (16) (y e C^([0, 1]) if g is continuous). Moreover, y is an entire function 
of order ^ in A. We recall that a function / = /(A) of the complex variable 
A is an analytic function if has the derivative whenever / is defined. If / 
is analytic in the whole plane, then / is said to be an entire function. Let 
M{r) = max|;^|^r 1/(^)1- An entire function / = /(A) has order s if s is 
the smallest number such that M{r) < exp(r''+'^) for any given e > 0, as 
r -^ oo. 
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By (16) one can also deduce that y{x, A) w ^'" J^^ for large |A|, precisely 



y(a;,A) 



sin ^/Xx 
auLC uiiau yy^, /\j 

sin vAx 



Ta 



> MLiexp(a;(||g|U. + |/myA|)), (17) 



uniformly in [0, 1], where ||<z||l2 = I J |(7(x)pda;] , v A = ReyX + ilmyX, 



10 

i = \/— T. The zeros of y(l, A) = are the eigenvalues {A„} of the Dirichlet 
problem (6)-(7) and, in this case, y = y{x, A„) is the associated eigenfunc- 
tion. By estimate (17), we expect that higher order zeros of y(l, A) = are 
close to the (square of) zeros of sinvA = 0, that is A„ w (titt)^ as n — > oo. 
More precisely, consider the circle C„ = {2 S C| \z — mrl = j} for n large 
enough. One can prove that there exists N G N such that for every n> N 
there exists exactly one zero of 2/(1, A) ~ inside the circle C„, namely the 
following eigenvalue asymptotic estimate holds 

|^A-H<^■ (18) 

The proof of (18) is based on a well-known result of complex analysis: the 
Rouche's Theorem, see Ahlfors (1984). Let / — f{z), g — g{z) be two 
analytic functions inside C„ and assume that |/(2)| < \g{z)\ on C„. Then, 
Rouche's Theorem states that g{z) and 17(2) + f{z) have exactly the same 
number of zeros inside C„. In order to apply this theorem, let us formally 
rewrite the function y(l. A) as 

,, ,, I , ^s sinvAa;\ sinyAa; ,,„. 

y{l,X)=[y{\) ^\+^^ (19) 




^ /(A) 
The delicate point consists in showing that |/(A)| < \g{X)\ on C„. Then, 

recalling that g{X) has exactly one zero (vAn — mr) inside C„. one obtains 

(18). 

Now, inserting the eigenvalue asymptotic estimate (18) in estimate (17) 

we get 

, , , sin \/A„a: ^ / 1 A /„„\ 

y(x,A„ = j== hO^ , as ri ^ 00. (20) 

Recalling that gn{x) — n, I'^^x )]l — ^^ obtain the asymptotic eigenfunction 
estimate 

gn{x) ^V2sm{7WTx) + o(-] , (21) 
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which holds uniformly on bounded subsets of [0,1] x i^(0, 1) as ti — > oo. 
Finally, by iterating the above procedure, the eigenvalue estimate (18) can 
be improved to obtain 

1 rl /I 



An = (m:) + / q(x)dx — / cos(2m:x)q(x)dx + O — , as n — > oo. 
Jo Jo \nj 

(22) 

3 Uniqueness Results: the Borg's Approach 

3.1 Symmetric Potential and Dirichlet Boundary Conditions 

Let us consider the Dirichlet eigenvalue problem 

r j/"(x) + Xy{x) = qix)y{x), in (0, 1), (23) 

j 2/(0) = = y(l), (24) 

where q £ i^(0, 1) is a real-valued potential. Denote by {gn{x,q),Xn}'^=^i, 
/p g^dx — 1, the eigenpairs of (23)-(24). Let us compare the high order 
eigenvalues of the above problem with those of the reference problem with 
q = 0: 

j z"ix) + xz{x)^0, in (0,1), (25) 

I z(G)=0 = z(l), (26) 

An easy calculation shows that Xn = (''^i')^! 97i{x,Q) = v2sin(n7ra;), n > 1. 
Then, by the asymptotic eigenvalue estimate (22) we obtain 

A„ = Xn + / q(x)dx — / cos(2mrx)q{x)dx + O ( — | , as n — > oo. (27) 
Jo Jo \nj 

Generally speaking, it turns out that knowledge of the high order eigenvalues 
can give information only on the average value of q and on the higher order 
Fourier coefficients of q evaluated on the family {cos{2mTx)}'^^i. Since the 
family {y/2 cos{2mTx)}^^i U {1} is an orthonormal basis of the space of the 
even functions with respect to x = 2 

LleniO, 1) = {./ e £'(0, 1)1 fix) = /(I - x) a.c. in (0, 1)}, (28) 

we expect to be able only to extract information from {A„}J^]^ for the even 
part of the potential q. 

These heuristic considerations were made rigorous in the following cele- 
brated theorem by Borg (1946). 
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Theorem 3.1 (Borg (1946)). Let q e Ll^,^,,^{0,l). The potential q is 
uniquely determined by the full Dirichlet spectrum, {\n\'^=i- 

The proof is by contradiction. Suppose that there exist another potential 
p G -Lg„g„(0, 1), p ^ q, such that the eigenvalue problem 

z"{x) + \z{x) = p{x)z{x), in (0, 1), (29) 

2(0) = = z{l), (30) 

has exactly the same eigenvalues of (23)-(24), i.e. \n{p) = ^n{q) for every 
n > 1. In order to simplify the notation, let us denote by gn{q) = gn{x,q), 
9n{p) = gn{x,p) the normalized Dirichlet eigenf unctions associated to A„ for 
potential q and p, respectively. Note that the nth Dirichlet eigenfunction 
is even when n is odd, and is odd when n is even. By multiplying the 
differential equation satisfied by gnil), ffn(p) by 5„(p), gnil), respectively, 
integrating by parts in (0, 1) and subtracting, we obtain 

/ (9 - P)gn{p)gn{q)dx = 0, for every 71 > 1. (31) 

By the asymptotic eigenvalue estimate (22) we have 

/ qdx ~ I pdx (32) 

and then condition (31) can be written as 

(9 - P)(l - 9n{p)gn{q))dx = 0, for every n>l. (33) 

Therefore, to find the contradiction it is enough to show that the family 
{1} U {1 — gniq)gn{p)}'^=i is a complete system of functions in Lg^,g„(0, 1). 
Actually, we shall prove that this family is a basis of Lg^g„(0, 1). We recall 
that a sequence of vectors {u„}J^i in a separable Hilbert space iJ is a basis 
for H if there exists a Hilbert space h of sequences a ~ {ai,a2, ■■■) such 
that the correspondence a — *■ X]n=i '^n'^n is a linear isomorphism between 
h and H (that is, an isomorphism which is continuous and has continuous 
inverse). 

Let us introduce the set of functions {C/„}J^o defined as 

r Uo{x)^l, (34) 

I Unix) = V2 i^J^ g„{q)gn{p)dx - g„(<7)g„(p) j , n > 1. (35) 
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Clearly, {t/n}J^g is a bounded subset of Lg^g„(0, 1). We will prove that 
{Un}'^=o is a basis of Lg„g„(0, 1). This immediately implies that also {1} U 
{gn{q)gn{p) — l}^i is a basis of i7g„g„(0, 1). At this point we make use of 
the following useful result. 

Theorem 3.2 (Poschel and Trubowitz (1987), Theorem 3 of Appendix D). 
Let {e„}„>o be an orthonormal basis of an Hilbert space H . Let {dn}n>o 
be a sequence of elements of H . Lf 

^) {dn}n=i:) «5 ■S'"c/l that Yln=0 ll^n ^ ^"lllf < +°0' 

and 
a) {dnjj^o '^'"s linear independent in H , 
then {dn}^o *■* ^ basis of H . 

We recall that a sequence {wn}^i in a separable Hilbert space H is 
linearly independent if X]^i Cn^n = for some sequence {c„}$^]^ with 
S^i '^n < °°7 then c„ = for all n. We apply Theorem 3.2 with H = 
iLe«(0,l), {e„}^=o = {V2cos(2n^a;)}^^i U {eo = 1}, d„ = t/„ for every 
n> 0. 

Condition i) is easily checked. By the asymptotic eigenfunction estimate 
(21) we have 



Un{x) = \/2cos(27i7ra;) + (-) , 



(36) 



T.\\^--Un\\h-f2o(^)<oo. (37) 



and therefore 



n>0 n=l ^ 

The proof of the linear independence stated in condition ii) is more diffi- 
cult. The original idea of Borg was to find a sequence of bounded functions 
{Vm}m=n C Let)en(Oj 1) such that {C/„, Vm}m.n=o ^^ ^ bi-orthonormal system 
of functions in Lg^g„(0, 1), that is 

J (t/„,K) = l, for every n>0, (38) 

I (t/„, V„i) ~ 0, for every m, n > 0, n ^ m, (39) 



where {UmVm) = /q Un{x)Vm{x)dx. Choose 



(40) 
(41) 



\ Vm{x) = a',n{x), m>\, 

with 

a„(x) =5m(x, g)Cm(2^,p), (42) 
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where Cm = Cmix,p) is a suitable solution of the differential equation 

C + A,„Cm=PCn, in (0,1). (43) 

Note that (^„i is not an eigenfunction of p since (^m does not necessarily 
satisfy the Dirichlet boundary conditions at x = and x = 1. 
By definition, we have 

iUo,Vo)^l, (C/o,y„) = 0, iUn,Vo)=0, n>l. (44) 

Assume m, n > 1. Since 

(C/„ , y,„) = -%/2(g„ (g)5„ (p) , a'J (45) 

and 

{9n{q)gn{p),a'r,i) = - ((.9n (<7)ffn (p))', Am) , (46) 

a direct computation shows that 



{{gn{q)gn{p))',am) = X 






dx. (47) 



If m 7^ n, m, n > 1, then by using the differential equation satisfied by 
gm{q) and Cm(p) we have 

(A,„ - Xn){{gn{q)gnip)y,a„i) = 

= idetf^rj'^i •^rJ'^M det f ^rS^! ')f\)\-l = o, m 

since gm{x,q) = gn{x,q) = at cc = and a; = 1. This means that 
([/„, Vm) = for m.n > 1 and rn 7^ n. 
Let m — n, m,n> 1. Recalling that 

the bi-orthonormality condition 
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is satisfied if and only if 

Ul,p)g'n{l,p) - Cil:P)9n{l,p) = V2. (51) 

The function C„ is not uniquely determined by the single condition (51). 
We must impose a second initial condition at x = ^. Recalling that for n 
odd the function g„(p) is even and g^(|) — 0, the function Cm(p) can be 
chosen such that 

UI,P)^0, Ciip)^ ^- (52) 

^ ^ 9n(2,P) 

Then, the function Cn{p) is odd with respect to a; = 1. Conversely, if n is 
even, gn{p) is odd and the function Cn(p) can be chosen such that 

U\,p)^^i^y C(;^,P)=0, (53) 

that is C,n{p) is even with respect to x = 1. In conclusion, for ri even, gn{p) 
is odd and Cn(p) is even, and then the function {gn{p)Cn{p)y is even. Simi- 
larly, when n is odd, gn{p) is even and Cn(p) is odd, and then the function 
{gn{p)C,n{p))' is still even, and the construction of the family {l^n}$^o is 
complete. 

3.2 Symmetric Potential and Neumann Boundary Conditions 

The method presented in the previous section can be extended to cover 
the case of Neumann boundary conditions. Consider the eigenvalue problem 

{ y"{x) + Xy{x)^q{x)y{x), in (0,1), (54) 

j y'(0) = = j/'(l), (55) 

with q € L^(0, 1) real-valued. Denote by {A„, g„}^Q the eigenpairs. Borg 
(1946) proved the following uniqueness result. 

Theorem 3.3 (Borg (1946)). Let q G L^„g„(0, 1). The potential q is 
uniquely determined by the reduced Neumann spectrum {A„} 



oo 
n=l- 



As in the Dirichlet case, the crucial point is to prove that the family 
{1} U {gn{q)gn{p)}^=i is a complete system of functions of ie^,e„(0, 1). 

It is worth noticing that the uniqueness result holds without the knowl- 
edge of the lower eigenvalue Aq. Actually, the lower eigenvalue plays a 
special role in this inverse problem, as it is shown by the following theorem. 
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Theorem 3.4 (Borg (1946)). Let q e L'^{0, 1) with /g^ q ^ 0. If the smallest 
eigenvalue Xq of the problem (54) -(55) is zero, then q = in (0, 1). 

In fact, let us denote by yo the eigenfunction associated to the smallest 
eigenvalue Aq. By oscillatory properties of Neumann eigcnfunctions, yo does 
not vanish in [0, 1]. Then, we can divide the differential equation (54) by j/o 
and integrate by parts in (0, 1): 

Jo Jo yo Jo \voJ \yoJ Jo \yoJ 

By (56) we get yo = in [0, 1] and then q = in (0, 1). 

It should be noted that the above uniqueness results can not be ex- 
tended, in general, to the Sturm-Liouville problem with even slightly differ- 
ent boundary conditions, for example 

r a2/(0) + y'(0) = 0, (57) 

j 72/(1) + 2/'(l)=0, (58) 

where a, 7 G M. Indeed, if a 7^ and 7 7^ 0, functions analogous to 
functions Un, Vn introduced in the proof of Theorem 3.1 are not necessar- 
ily symmetrical with respect to the mid-point x = ^. Borg (1946) gave a 
counterexample in which the eigenvalue problem with the boundary condi- 
tions of the type (57)- (58) does not lead to a complete set of functions in 

^eveni^^ !)■ 

3.3 Generic L^ Potential 

The uniqueness results addressed in the two preceding sections show that 
the set of functions {ffn (9)371 (p)}^0' where g-niq), g-nip) are the nth eigen- 
function corresponding either to Dirichlet or Neumann boundary conditions 
for potential q and p respectively, are complete in Lgj,g„(0, 1), that is in a 
space of functions which, roughly speaking, has half dimension of the whole 
space i^(0, 1). To deal with generic i^(0, l)-potentials, the idea by Borg 
(1946) was to associate to the original Sturm-Liouvillc problem another 
Sturm-Liouville problem such that the set of functions {ffn(<7)<7n(p)}$^o of 
the original problem together with the set of functions {5„('z)5„(p)}^o o^ 
the associated problem form a complete set of L^(0, 1). In particular, the 
boundary conditions of the associated problem are chosen so that they pro- 
duce an asymptotic spectral behavior sufficiently different from that of the 
initial problem. 

Let q e L^(0, 1) be a real- valued potential. As an example, we shall 
consider the Sturm-Liouville problem 
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V"{x) + \v{x)^q{x)y{x), in (0, 1), (59) 

2/(0) = 0, (60) 

2/(1) = (61) 

and its associate eigenvalue problem 

y"{x) + \y{x)^q{x)y{x), in (0, 1), (62) 

y{Q) = 0, (63) 

7F(l) + y'(l) = 0, (64) 

where 7 € M. Let us deiiote by {A„(g), y„((j) = 2/n(a;,g, A„(g))}^^o and by 
{A„(q),27„(g) = y„(x, g, A„(q))}^o t^e eigenpairs (with normalized eigen- 
functions) of the eigenvalue problems (59)-(61) and (62)-(64), respectively. 
Let us introduce the two companions Sturm-Liouville problems related re- 
spectively to (59)-(61) and (62)-(64) with potential p € L?{Q; 1), namely 

z"{x) + \z{x)^p{x)z{x), in (0,1), (65) 

z(0) = 0, (66) 

z{\) = (67) 

and its associate eigenvalue problem 

z"{x) + \z{x) = p{x)z{x), in (0, 1), (68) 

z(0) = 0, (69) 

7z(l) + z'(l)=0. (70) 

Let {A„(p),Z„(p) = Zn{x,p,Xn{p))}'^=Q, {A„(p),^„(p) = Zn{x,p,\n{p))}n=0 

be the eigenpairs (with normalized eigenfunctions) of the eigenvalue prob- 
lems (65)-(67) and (68)-(70), respectively. 

Borg (1946) proved the following uniqueness result by two spectra. 

Theorem 3.5 (Borg (1946)). Let q, p G L^ (0,1). Under the above notation, 
if Kiq) = Kip) and A„(p) = A„(g) for every m>0, then p^ q. 

The proof follows the same lines of the proof of Theorem 3.1. 

4 Uniqueness Towards Stability 

4.1 Hochstadt's Formula 

In previous section some uniqueness results for the Sturm-Liouville in- 
verse problem have been presented. We have seen that uniqueness for 
generic L^ potentials is guaranteed by the knowledge of two complete spec- 
tra associated to different sets of boundary conditions. This situation is 
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difficult to meet in practice. Here we shall consider a case in which two 
potentials share a full spectrum and differ only for a finite number of eigen- 
values of a second spectrum. Hochstadt (1973) proved that the difference 
between two potentials can be written explicitly in terms of certain solu- 
tions of the Sturm-Liouville operators. This result suggested an algorithm 
for solving the inverse problem with finite data and to achieve stability 
results, see Hochstadt (1977) and Hald (1978). 
Consider the Sturm-Liouville problem {L: a, f3) 

Lu = ~u" + qu ^ \u, in (0,1), (71) 

u(0)cosa-^M'(0)sina = 0, (72) 

u(l)cos/3 + w'(l)sin/3 = 0, (73) 

with a,/3 € M and where q € L^{0, 1) is a real- valued potential. Denote by 
{A,} the spectrum of (L; a, /3). 

Replace the boundary condition (73) with the new boundary condition 

M(l)cos7 + w'(l)sin7 = 0, (74) 

where 7 e M is such that sin(7 — /3) 7^ 0. Denote by {A^} the spectrum of 
(L;a,7). 

Consider now a second Sturm-Liouville problem {L;a, (3) with 

Lu = —u" + qu — Xu, (75) 

with q (z L^(0, 1). Denote by {Aj} and {A^} the spectrum of {L;a,l3) and 
(i;Q;,7), respectively. 
Assume that 

Xi ^ Xi, i e Aq, Aq == finite set of indices, (76) 

A, = A„ ieA = N\Ao (77) 



and 



Then 



A: = A^, for every L (78) 

q^Y.^y„w„Y, a.e. in(0,l), (79) 



Ao 

where y„ is a suitable solution of the differential equation 



y:: + (A„ - q)y„ = 0, in (0,1), (80) 
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and Wn is the nth eigcnfunction of {L; a, [3). If Aq = 0, than q ^ q a.c. in 
(0, 1) and we obtain the uniqueness for L^-potcntials from two spectra. 

Instead of giving the detailed proof of the Hochstadt's resuh, we wih 
see the main idea of the method and the key points of the analysis. Let us 
denote by w — w(x, X) the solution of the Cauchy problem 

w" + iX-q)w^O, in (0,1), (81) 

w(0) = sina, (82) 

w'(0) = -cosa, (83) 

and by w = w{x, A) be the solution of 

w" + {X-q}w^O, in (0,1), (84) 

w(G)=sina, (85) 

w'(0) = -cosa. (86) 

Observe that w„ — w{x,Xn) and w„ — w{x,Xn) are the nth eigenfunction 
of {L;a,f3) and {L;a,f3), respectively. Since we have no information of 
Xn G Aq, we try to characterize the relation between the cigenfunctions Wn 
and Wn for n e A. Let us introduce the subspaces of L^{0, 1) 

H={feL^iO,l)\ {f,w,)=0, zeAo}, (87) 

H={feL'{0,l)\ (/,mO=0, leAo} (88) 

and define the linear operator 

T : H -^ H (89) 

such that 

T{wn)=Wn, neA. (90) 

The operator T turns out to be bounded and invertiblc. The key point of 
the method is the characterization of the operator T. This can be done in 
two main steps: 
i) By the partial knowledge of the "first" spectrum {AijigA one can derive 
the identity 

(A-Z)T(A-L)-^ =r, for every A e C, A 7^ A,, i e A. (91) 

ii) By the full knowledge of the "second" spectrum {AJ}, the structure of 
the operator T can be determined. 
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Finally, the expression (79) follows by replacing the structure of T found in 
step ii) in (91). 

Concerning step i), to find (A — T)^^ we need to solve the equation 

(A - L)u = /, A e C \ {A„}„eN (92) 

for every f E H and under the boundary conditions (a,/3). By expressing 
/ by means of its eigenfvmction expansion / = J2a fnWn, fn = tHt^' '^^ 

II ^W 1^2 

the basis {w^} of 2.^(0, 1) (with {w„,Wk) = /„ w^Wk = \\w„\\l2Snk), we get 



= (A-L)-V = EyA-«'- (93) 

^ A-A„ 



Recalling that T is bounded we have 

A — A„ 



t(a-l)-V = Ey^^- (9^) 



A 



The right hand side of (94) belongs to the range of L and then, recalling 
that A„ = A„ for n G A, we have 

(A - Z)T(A - L)- V = Yl ^"^^f"'/^"^ = Y. /"^" = ^/' (95) 

A -^ ~ ^" A 

for every f G H, which implies the identity (91). 

Concerning step ii), we shall introduce an alternative representation of 
the operator T via the Green function of the problem (L;a,/3). Let v be 
the solution of 

( {\^L)v = v" + {\-q)v:^0, in (0,1), (96) 

?;(!) = - sin /3, (97) 

v'{l) = cos/3. (98) 

Then, for A ^ A„, the solution u = (X — L)^^f of 

(A - L)u = /, in (0, 1), (99) 

M(0)cosa + u'(0)sina = 0, (100) 

u(l)cos/3 + M'(l)sin/3 = (101) 

can be written in terms of the Green function G[x, y; A) as 

u{x,X)^ f G{x,y;X)f{y)dy, (102) 

Jo 
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where 

/ \ / \ ( w(x)v(y) ^ 

G(x,j/;A) = —— = <^ w{y)v[x) . (103) 

a;< = min(a;, y), x> = max(a::, j/), (104) 

w(A) = w(l, A) cos /? + u;'(l, A) sin/3 (105) 

and the function w has been defined in (81)-(83). The function lo = ti;(A) is 
an entire function of order 5 in A and its zeros are the eigenvalues of (i; a, /?). 
Since Wn = w{x^ An) and Vn = v{x, A„) are eigenfunctions associated to the 
(simple) eigenvalue A„ of {L;a,(3), we have 

knW„{x) = v„{x), (106) 

where fc„ is a no vanishing constant given by 

fc„ = -sin/3/w;„(l) if sin/3 ^0, fc„ = l/w7.;,(l) if sin/3 = 0. (107) 

Note that llu;,,!!^^ = ^^T^- 

By the asymptotic behavior of w{x, A), v{x, A) and a;(A) as A — > 00, it 
can be shown that the Green function G{x,y;X) can be expressed by the 
Mittlag-LefEer expansion 



Gix, y,X) = }^ „/Q,,UX_X,,V (10^) 



?i=0 



see Ahlfors (1984). Then, for every f E H wc have 

.(.,A) = (,,^)-r^,g fe»^n(.)/^^„(.)/(.)rf. ^ 
^ w'(A„)(A-A„) 

and applying T we get 

rrf\ T\-if-Sr^ Kwn{x) J^ f{y)wniy)dy + Wn{x) J^ f{y)vn{y)dy 
^^^^"^^ ^-■^ c.'(A„)(A-A„) ■ 

(110) 
Let Wn, Vn be the solutions of the initial value problems (81)-(83), (96)-(98), 

respectively, with q replaced by gand A„ replaced by A„. Then, as in (106), 
there exists fc„ 7^ such that k„Wnix) = Vn{x). 

Exactly on this point, wc make use of the second (full) spectrum (e.g., 
A'; = A' for every i) to proof the crucial fact 

kn = kn, for every n G A. (HI) 
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To prove this identity, let us introduce the frequency function 

z/(A) = it>(l)cos7 + it/(l)sin7 (112) 

for (L;a,7). The function v — v(X) is an entire function of order ^ in A 
and its zeros are the eigenvalues of the problem (L; a, 7). By a well-known 
property of the entire functions of order \ (see Ahlfors (1984), p. 208) we 
have 

KA) = an(l-v)' (113) 

where a is a suitable constant. (Without loss of generality, we have assumed 
A^ 7^ for every i.) Similarly, we can define the function v{X) for operator 
L under boundary conditions and (a, 7), respectively. We have 



v(\)^a\\\\-^\, (114) 




where a is a suitable constant. But, by assumption, A^ = A^ for every i and 
then, also recalling the asymptotic estimates of eigenvalues, we have 

v{\) = ^^(A). (115) 

Now, by evaluating lo(X) and v[X) at A = A„, we have 

J w„(l)cos/3 + <(l)sin/3 = 0, (H^) 

1 w„(l)cos7 + w^(l)sin7 = i/(A„). (H^) 

Solving the linear system with respect to if„(l) and recalling (107) we get 



^ ^ sin(7 - /3) 
Using the same procedure for the operator L we obtain 



(118) 



V\\n) 

and recalling (115) we finally have /.;„ ~ fc„ for every n e A. 

We are now in position to conclude the proof of Hochstadt's formula 
(79). By substituting (111) in (110) we get 

rri\ r ^-l f _ Y^ ^"(^) /o" f(y)'^niy)dy + Wnjx) /J f{y)vniy)dy 
T{X-L) f-X. a;'(A„)(A-A„) ' ^'^''> 
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The subtle point is now to realize that the right hand side of (120) is closely 
related to the Mittag-LeSler expansion of the function 

f ^ _ ^(^) So fiy)w{y)dv + w{x) J^ f{y)v{y)dy 
w(A) 

where v{x) is solution of (96)-(98) with q replaced by q, and w{x) is solution 
of (84)-(86). In fact, we can represent g{x) in Mittag-Lefher series distin- 
guishing between residues related to poles belonging to Aq and to poles 
belonging to A: 

f _ Y^ Unjx) J^ f{y)wn{y)dy + Znjx) J^ f{y)vn{y)dy 
^^""'-^ c.'(A„)(A-A„) + 

Ao 

\r^ ^n(^) /o" I{v)wn{y)dy + Wn{x) J^ f{y)vn{y)dy 
^^ c.'(A„)(A~A„) ' ^'""^ 

where, to simplify the notation, we define 

Un = v{x, A„), Zn = w{x, A„), n e Aq. (123) 

The second term on the right hand side of (121) coincides with an expression 
present on the right hand side of (120). Now, we replace the above equality 
inside the identity (91) obtaining 

,. 7x / / N x^Un{x)J^f{y)wn{y)dy + In{x)J^f{y)vn{y)dy\_ 
\ ^ w'(A„)(A-A„) J 

(124) 
for every f E H. After a simple but lengthly calculation and recalling the 
asymptotic estimates of the function involved as A — > oo, we obtain the 
characterization of the operator T 

T} = .f-\Y1 y"(^) / y^^n{y)I{y)dy, (125) 

where _ _ 

1 u^^{x)-K^^ (126) 

2 w'(A„) 

The thesis (79) follows by using the representation (125) in (91). 

The above result has been extended by Hochstadt (1973) to symmet- 
ric operators, that is even potentials and boundary conditions such that 
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(3 — —a. The proof follows the lines we had sketched above. In particu- 
lar, information on a second spectrum is replaced by the symmetry of the 
problem. 

Extensions to Sturm-Liouville operators of the form 

iu= --((pu')'-'?"), (127) 

r 

with suitable a priori assumptions on r, p and g, are also available Hochstadt 
(1975). In this case, it is quite clear that there is no hope to determine all 
the coefficients r, -p and q by the knowledge of two full spectra (associated 
to suitable boundary conditions). Hochstadt was able to prove that, under 
suitable assumptions, the uniqueness theorems still hold modulo a Liou- 
ville transformation. The corresponding problem for a taut string has been 
considered in Hochstadt (1976). 

4.2 A Local Stability Result 

In the preceding section it has been shown that the potentials associated 
with two Sturm-Liouville operators which have a spectrum in common and 
differ in a finite number of eigenvalues of a second spectrum are intimately 
linked by explicit expression (79). This result raises the question of the well 
posedness of the inverse problem. The question is to determine, for example, 
if a "small" perturbation of the finite set of eigenvalues Aq (which are not in 
common between the two operators) leads to a "small" perturbation of the 
unknown potential. A first affirmative answer to this question was given by 
Hochstadt (1977). 

Theorem 4.1 (Hochstadt (1977)). Lett > 0. Under the assumptions stated 
at the beginning of section 4-1 and by using the same notation, if 

|A, -A,|<e, ^GAo, (128) 

then 

\q{x) - q{x)\ < Ke, a.e.m (0,1), (129) 

where K > Q is a constant depending only on the data of the problem. 
By (79) we have 

q-q^^{yiWi)', a.e. in(0,l), (130) 



where 
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f w'/ + (X, - qix))uH ^ 0, in (0,1), 
{ Wi{0) = sin a, 
u;-(0) = —cos a 



(131) 
(132) 
(133) 



and 



u^{x)kM^^ (134) 

w'(Ai) 
In (134), the functions Ui and % are solutions to 

r]" + iX^-qix))7j^0, in (0,1), (135) 

with initial conditions respectively given by 

Zi{0) = sina, Zi(O) = — cosa, (136) 

u,(l) = -sin/3, u^(l) = cos/3. (137) 

The fimction uj = uj{X) is defined as 

uj{X) = w{l, X) cos/3 + w'{l, X) sin/3, (138) 

where ^(a;, A) is a solution of (131) with A^ replaced by A. Recall that 
u! = io{X) is an entire function of A of order | and that its zeros are the 
eigenvalues {Xi} of L under the boundary conditions (a,/3). 
The constant ki is given by 

where 

i/(A) = w{l, A) C0S7 + ?«'(1, A) sin7 (140) 

is an entire function of order 1/2 in A. The zeros of ly are the eigenvalues 
{A^} of L under the boundary conditions (a, 7). 
By the assumption (128) we have 

A, = A, + eC» with Id <1, (141) 

and the differential equation (135) can be written as 

V" + (A, - q{x))ii = -eC.77, in (0, 1), (142) 

Let us first consider the function z^, which satisfies the boundary condition 
(136). If Ci — Oi then Zj is an eigenfunction coinciding with Wi. Therefore, 
we try to find Zi in the form 

Zt = Wi + eFi. (143) 

Replacing this expression in (135)-(136), the perturbation r^ solves the ini- 
tial value problem 
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7'l + {\-q{x))n^-aw^ + e7,), in (0,1), (144) 

f,;(0)=0, (145) 

F^(0)=0. (146) 

Then, by well-known results (see Poschcl and Trubowitz (1987), Chapter 1, 
Theorem 2) we have 

n{x) = -G / {yi{t)y2{x) - y2{t)yiix))iw,it) + en{t))dt, (147) 

where {yi, j/2} is a set of fundamental solutions of the differential equation 
y" + {\ — q)V = in (0, 1) satisfying the initial conditions yi(0) = 2/2(0) — 1 
and j/i(0) = 2/2(0) = 0. The function ri{x) solves the Volterra equation (147) 
and it is of class C^ in (0, 1). It should be noticed that, in general, ri{x) 
depends also on e. There is no dependence if equation (147) is linearized by 
neglecting the term 0(e) in the right hand side of (147). 
Similarly, we define 

Ui^Vi + esi, (148) 

where s^ solves 

■ 7l + {K~q{x))s, = -av, + eli), in (0,1), (149) 

< s,(l)=0, (150) 

_ 5^1)= 0. (151) 

The functions u; and Wi are eigenfunctions of L (under the boundary con- 
ditions (a,/3)) associated to the same eigenvalue A;. Therefore 

Vi{x) = kiWi (152) 

with 

~ sin(7^^ (153) 

Since the two spectra {A^} and {A^} (respectively of L and L under the 
boundary conditions (01,7)) coincide, we have i^(A) = i^(A) and 

HK) - v{K)\ < W'{^,)\\X, - K\ < Ke, (154) 

where i?i € (Ai,Ai) and iiT is a suitable constant depending only on {AJ}. 
By (139) we have 

,_sin(7-/3)_ sin(7-/3) ^^^^^^^^ ^^^^^ 



^(A^) j.(A,) + z.'(i9,)(A,-A,) 
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where li is a suitable number. 

By using the above estimates in (134) and by (152) we obtain 

-/ N ^Ui{x) - kiZi{x) {vi{x) +esi{x)) - {k^ + eli){wi{x) +eri{x)) 
Vi^^l = 2 TTV~\ = 2 TTV~\ = 

Uj'(\i) Uj'[\i) 

(vi{x) - kiWi{x)) + esi{x) - ekiri{x) - eliWi{x) - e^liri{x) 

^ 0{e) (156) 



iO'{X^) 



Lu'iX,) + 0(e) 

and then 

mx)\<Ke. (157) 

By using similar arguments we find 

mx)\<Ke. (158) 

Finally, by (130) the thesis follows. The above results can be also extended 
to symmetric potentials. 

5 A Local Existence Result 

The first existence result for the inverse Sturm-Liouville problem is due to 
Borg (1946). Consider the differential equation 

y" + Xy = qy, in (0,1), (159) 

with 

qGL^(0,1), f q{x)dx = 0, (160) 

Jq 

and the two sets of boundary conditions 

y(0) := == y(l), (Dirichlet) (161) 

2/(0) = 0, 7j/(l)+2/'(l) = 0, (Mixed) (162) 

where 7 is a real number. 

It is well known that the potential q is uniquely determined by the knowl- 
edge of the spectrum {A„}^o of (159), (161) and of the spectrum {Hn}^=o 
of (159), (162) (see Theorem 3.5). Suppose to perturb the two spectra 
{A„},T=o and {Mn}~=o> and denote by {A^jjf^o, {mJ;}^=o the perturbed 
eigenvalues. We wonder if there exists a Sturm-Liouville operator of the 
form 

y" + X*y = q*y, in (0,1), (163) 
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with 



<?*eL^(0, 1), / q*{x)dx^O, (164) 



such that the spectrum of (163) under the boundary conditions (161) and 
(162) coincides with the given spectrum {A*}^0' iMnln^O' respectively. 
In other words, we assume that the values of {A*}^o ^^'^ {f^n}n'=Q ^^e 
given, and the question is whether there exists a potential q* £ i^(0, 1), 
/o q*{x)dx = 0, such that these two sequences arc the eigenvalues for the 
boundary conditions (161) and (162), respectively. 

Borg (1946) proved that if the sequences of eigenvalues {A* }J^q, {/^J^}^o 
are a sufhcicntly small perturbation of the spectra {A„}J^g, {/^nlj^o '^'^^~ 
responding to a Sturm-Liouville problem with potential q, then there exists 
a potential q* with the desired property. 

The solution of the existence problem will also indicate a way to con- 
struct the potential q* from q based on a successive approximation method. 
The result of existence has a local character because the perturbation of the 
two spectra must be sufficiently small. 

Theorem 5.1 (Borg (1946)). Under the above assumptions, there exists a 
number p > such that if 

OO C30 

ra=0 Ti=0 

then the sequences {A*}J^o; {/^nln^o '^^^ ^^'^ spectrum of the (unique) dif- 
ferential equation 

y" + X*y^q*y, zn(0,l), (166) 



with 



/■I 

-2 



q* e L^(0, 1), / q*{x)dx = 0. (167) 







Proof. Let us denote by 2; = z{x), z* — z*{x) the solutions of (159) (po- 
tential g), (166) (potential q*), respectively, and let us order the eigenval- 
ues and the corresponding eigenfunctions of the problems (159), (161) and 
(159), (162) as follows: the even index v — 2n ^ 2 corresponds to the nth 
eigenpair of (159), (161), while the odd index v — 2n + 1 \s for the nth 
eigenpair of (159), (162). Therefore, the eigenvalues {A„}5^q, {yU„}$^o ^-"^^ 
collected in a single sequence {\u}'^=i and the corresponding eigenfunctions 
are denoted by {zy}^^^. The eigenfunctions and eigenvalues of the problem 
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(166) will be ordered similarly, and will be denoted by {^,*}^i and {A*}^]^, 
respectively. Therefore, condition (165) is rewritten as 

oo 

Y.\Xu-K\'<p'- (168) 

Let us suppose for the moment that there exists a differential equation (166), 
namely that there exists a potential q* , with the desired properties and let 
us rewrite (166) in the form of perturbation of the reference equation (159) 

y" + {\~q)v = {\-X*+q* -q)y, in (0, 1), (169) 

where the perturbation is represented by the right hand side of (169). Note 
that the eigenfunction z* is a solution of (169) for A = A^ and A* = A*, 
z/ = l,2,.... 

Let Ci/ be a solution of (159) with A = A^ linearly independent of z^, 
(= i/th eigenfunction of (159) under boundary conditions (161) or (162)) 
chosen so that 



where 



D, = VA. if A, > 1, D, = l if K < 1, (170) 



^^='^^f SS cS '• ^'''^ 



Define 



K,{x,t) = Z,{x)(n{t) - Z,{t)Cr.{x), (172) 

$,(t)-A,-A:+q*(t)-q(t). (173) 

By (169), the function z* (=i/th eigenfunction of (166) under boundary 
conditions (161) or (162)) solves the Volterra equation 

ztix) = z,{x) + r ^^^^,{t)z:{t)dt. (174) 

By solving (174) with the method of the successive approximation, z* can 
be represented in the form 

oo 

z:{x) = z,{x) + Y.Ulr\x), (175) 

p=i 

where 

U^P^x) = K,o {K, o (... o K,{z,))) = KPoz,= 

= -r^K{x,Xl) j K{XI,X2)- I ...Z^{Xp)\\<^^{Xq)dXq (176) 
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and then 






^ 1 rx pxi pxp-i f 

^~nP / •■• / K^{x,Xl)-...-K^{Xp_l,Xp)z,,{Xp)Y\_^u{Xq)dXq. 

P=l ^^ Jo Jo Jo g^l 

(177) 



A direct calculation shows that 

»i 



z;z^<^^dx = 0. (178) 







By (177) and (178), for every :^ = 1, 2, ..., we get 



1 fi 



z^{x)A{x)dx + Si, / z^{x)dx+ 
Jo 

+ J2TiP / ••■ / QAxi,--,Xp+i)Y['^u{xq)dxq = 0, (179) 
p=i ^^ Jo Jo Jo ^^1 

where 

A{x)^q*{x)-qix), (180) 

<5, = A, - a:, (181) 

Q^ = z,,{xi)K{xi,X2) ■ ... • K^{xp,Xp+i)z^{xp+i). (182) 

Now, the theorem is proved if we are able to prove that there exists a 
function q* G L^(0, 1) that satisfies all the equations (179) and such that 
/(, q*{x)dx ~ 0. Note that the nonlinearity of the equations (179) is due to 
the terms $i/. 

The function q* can be determined by the successive approximation 
method. Suppose that the third term of (179) is zero (that is, we start 
assuming Aq = 0). Then, we need to find the fimction 

Ai{x) ^ ql{x) - q{x) (183) 

which satisfies 

/ zl{x)Aiix)dx = -5,, J. = 1,2,..., (184) 

Jo 

and 



/ Ai{x)dx^O, 1^ = 1,2,, 
Jo 



(185) 
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Here, the eigenfunctions Zi, arc normalized such that 

nl 

zl{x)dx^l, z/ = l,2,... . (186) 



Now, we recall that the sequence of functions {^,y}^i is a complete system 
of functions in £^(0, 1). We need also the following result. 

Theorem 5.2 (Borg (1946)). Let {cj,}^]^ he any subsequence of real num- 
bers such that 

oo 

^c2<oo. (187) 

Let {z^lJ^x ^^6 above sequence of functions. There exists a unique function 

/ e L2(0, 1) such that 

fi 

f{x)dx = (188) 



and 

c^^ j f{x)zl{x)dx, z/ = l,2, ... . (189) 

Jq 

Moreover, there exist two constants m and M independent of f , < m < 
M < CX3, such that 



oo -.1 oo 



m^^ci< I f'{x)dx<M'^ci. (190) 

By the assumption (168), the condition (187) is satisfied (with Ci, — —5^) 

oo oo 

5]<5^^5]|A.-A:p<p2<cx3 (191) 

and then, by Theorem 5.2, there exists a unique function Ai G L^(0,1) 
solution of (184), (185). 

Next approximation, A2(a;), can be determined similarly. In general, the 
function A,; (a;) solves for i^ = 1, 2, ... the equations 

1 nl 

zl{x)Ai{x)dx + S^ / zl{x)dx-\- 
Jo 

y^TiP / ■■■ / QAxl,■■■,Xp+l)T\'l>^,^-l{xg)dxg = 0, (192) 

P^^^l^JO Jo Jo g^l 
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where 

/ A,(x) = 0, (193) 

Jo 

$,,,_i(a:;)=(5, + A,_i(x), (194) 

9»*(a;) = 9(a;) - A,(a;), A^{x) ^ q*{x) - q{x), (195) 

A, eL2(0,l). (196) 

In order to prove the convergence of the iterative procedure we need to show 
that the sequence {A^J^q converges in L^{Q, 1). In the following we recall 
the main steps of the proof of convergence, see Borg (1946) for details. The 
proof is essentially in three main steps. First of all, one can prove that the 
sequence {Ai}^Q is bounded in L^(0, 1) for every choice of the eigenvalues 
{A*}^]^ provided that p is sufficiently small. More precisely, for every i 
there exists a constant A' > independent of i such that 

/ A^^{x)dx < K. (197) 

Jo 



/o 
This condition is satisfied if 



p< , (198) 



where 



if = l&slMHllie'^i - 1), (199) 

1 

(OO \ 2 

Y.jp] {D,^0{v), D,>1), (200) 

iaSiyi\Ki,{x„,x„j^i)\ < Mi, majc\zi,{xi)z^{xp+i)\ < Mi, (201) 

and A/ is the constant appearing in (190). 

The second step consists in proving that the sequence {A^J^q is a 
Cauchy sequence in L^(0, 1). Then, we have 



lim Ai(a;) = A(a;) e l2(0, 1), [ A 

I— OO Jo 



{x)dx = 0. (202) 



In the third step it is shown that the function q* found by the iterative 
process, e.g. q*{x) = q{x) — A(a;), actually solves the differential equation 
(166). Finally, by the uniqueness Theorem 3.5, the function q* is vmique 
and the proof is complete. D 
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Remark 5.3. Similar result also applies in the case of symmetric potential 
and when we know only one spectrum for given boundary conditions. Con- 
sider, for example, the differential equation y" + (A — q{x))y = in (0, 1), 
with q € iLeTi(Oi 1) ^i^d with spectrum {\n\'^=i under Dirichlet boundary 
conditions y(0) = = j/(l). It is possible to determine a differential equa- 
tion y" + (A* - q*{x))y = in (0, 1), with q* e Lf„en(0, 1) such that its full 
Dirichlet spectrum coincides with a given sequence of real numbers {A*}^i 
provided that 



E 



K-Xl,V<p\ (203) 



with p "sufficiently" small. The same argument can be also extended to 
Neumann boundary conditions y'(0) = = y'(l) under prescribed reduced 
spectra. 

Remark 5.4. Generally speaking. Theorem (5.1) is the analogous of the 
Riesz-Fischcr Theorem in the reconstruction of a function on the basis of 
its Fourier coefficients on a given basis of functions. However, the existence 
results for Sturm-Liouville operators based on eigenvalue information are 
valid only locally, namely if the distance p is small enough. This limitation 
seems not easy to remove and probably is due to the need to satisfy the 
inequality A„ < A„_|_i for every n, which have not correspondence among 
the Fourier coefficient representations. An attempt to use the local existence 
result to prove a global existence of the potential is sketched in Borg (1946). 



6 An Euler-Bernoulli Inverse Eigenvalue Problem 

Consider a thin straight uniform beam, of length L, resting on an elastic 
foundation of stiffness q = q{x) (per unit length), q{x) > 0. Let the beam 
be made by homogeneous, isotropic linear elastic material with constant 
Young's modulus E > and constant linear mass density p > 0. The free 
vibrations of the beam are governed by the partial differential equation 

EI^^^^^ + q{x)v{x,t)+p^^^^^ ^0, (x,f)e(0,i)x(0,(X3), (204) 

where / > is the moment of inertia of the cross-section and v ~ v{x,t) 
is the transversal displacement of the beam axis at the point of abscissa x 
evaluated at time t. The beam is supposed to be simply supported at the 
ends, namely 

V = 0, j—T =0. at x = and x = L, for every t > 0. (205) 
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Let us assume that the beam vibrates with frequency to, i.e. v{x,t) = 
u{x) cos(ujt). Then, the spatial free vibrations are governed by the boundary 
value problem 

Lu = u'^ + qu ^ Xu, in (0,1), (206) 

w(0) = = m(1), (207) 

w"(0) = = u"(l), (208) 

where X — uj'^ and where, to simplify the notation and without loss of gen- 
erality, we have assumed L = 1, EI — I, p — I. It is well known that for a 
given beam and a given potential q, q regular enough (e.g., q G C([0, 1]), q 
not necessarily positive), this eigenvalue problem has {A„,5„(a;)}^j eigen- 
pairs, with simple real eigenvalues such that lim„^oo ^n — oo. For example, 
ii q = 0, then A„ = (n7r)*, g„(x) — v2 sin(n7ra;) (/g g^{x)dx = 1), ?i > 1. 

A reasonable inverse eigenvalue problem consists in determining q from 
the eigenvalues {A„}^]^ of the beam. In this section we shall consider a 
uniqueness problem for symmetric q which is, in some respects, the fourth- 
order (local) counterpart of the Borg's Theorem 3.1 for Sturm-Liouville 
operators. The result has been obtained by Elcrat and Papanicolaou (1997) 
and it is stated in the following theorem. We refer to Gladwell (2004) for 
a general overview of the inverse problem for the vibrating Eulcr-Bernoulli 
beam. 

Theorem 6.1 (Elcrat and Papanicolaou (1997)). Let q = q{x) be a contin- 
uous potential in [0, 1] such that q{l — x) — q{x) and L q(x)dx — 0. There 
exists a constant Q, Q > Q, such that if \\q\\oo ~ max^-gjo^i] |'7(a;)| < Q, then 
the full spectrum {A„}^2^ determines uniquely q. 

The proof is by contradiction. Consider the eigenvalue problem 

Lu = u^'"' + qu^Xu, in (0,1), (2*^^) 

w(0) = = m(1), (210) 

w"(0) = = u"(l), (211) 

with ge C([0, 1]), /p g = 0, q{l-x) = q{x) in [0, 1], and let {A,i}^^i be the 
spectrum of L. We shall prove that, under the hypotheses of the theorem, 
if A„ = A„ for every n > 1, then q = qin [0, 1]. 

Denote by 5„, g„ the nth eigenfunction of L, L, respectively, with 
/o 9ndx = /o gidx = 1, n > 1. By multiplying by g„ (respectively, by 
Qn) the differential equation satisfied by Qn (respectively, by g„), integrat- 
ing by parts and recalling that A„ = A„ for every n > 1, we have 



/ (9 ^ (l)gngndx = 0, for every n > 1. 
Jo 



(212) 
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Therefore, it is enough to show that {.gnffnjj^i is a complete family of 
functions in Lg„g„(0, 1). 

Following Borg's approach, Elcrat and Papanicolau introduce the set of 
functions 

f Uoix)^l, (213) 

\ Unix) = \/2 i^J^ gnOndx - gn{x)]j„{x)^ , "- > 1. (214) 

Note that {C/„}^o i^ * bounded subset of Lg^,g„(0, 1), with J^ Undx — 
for every n > 1. By (212), we have 



/ (? ^ q)Undx = 0, for every 



n > 0, (215) 



and therefore the uniqueness result follows if {[/„}J^g is a complete family 
of functions in ig„g„(0, 1). Actually, it is possible to prove that {C/„}^o is 
a basis of Lg„g„(0, 1) for ||q||oo, Halloo small enough. 

Let {e„(x) = \/2cos(2n7ra;)}^2 U {1} = {enj^o ^e an orthonormal 
basis of Lg„g„(0, 1). Define the linear operator A : Lg„g„(0, 1) —> ig^g„(0, 1) 
as 

A{e„) = Un, n>0. (216) 

We prove that A is a linear boundedly invertible operator. By general 
properties, this result follows by showing that A can be written as A — I+T, 
where / is the identity operator and T is a compact operator with ||T|| < 1 
(see Friedman (1982)). 

At this point we need the following asymptotic estimates: 

\gn{x)^V2sm{mrx)\<C'-^\\q\\oo, as 7i ^ oo, (217) 

\g„{x) ~ V2sm{mrx)\ < C'-^\\q\\oo, as n -. oo, (218) 

where C, C are positive constants depending on Q and bounded as Q ^ 0, 
Elcrat and Papanicolaou (1997). Then, by (217), (218) we have 

lUJx) - V2cos(2ri7rx)| < C^^Q, as n ^ oo, (219) 

where C is a positive constant depending on Q. We also recall that, if 
q e ^([0, 1]), r = 0, 1,2,3, then the eigenvalues A„ of problem (206)-(208) 
have the following eigenvalue expansion 

|A„-(n7r)''| = (^V asn^oo, (220) 
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see Elcrat and Papanicolaou (1997). 

In order to prove that T is compact, it is enough to show that 

oo 

^||r«„||i. <oo, (221) 

71 = 

for some orthonormal basis {w„}^i of Lg„g„(0, 1), see Friedman (1982). By 
choosing u„ = e„, n > 0, we have 

re„IU. =0(^^) (222) 

and condition (221) is satisfied. 

Let / = X^n/n^n bc any element of L^„g„(0, 1), where /„ = (/,e„). 
By definition, the image of / under the operators A, T is given by Af = 
J2kiJ2n^knfn)ek, Tf = J2k iJ2n^k;Jn) Sk, respectively, where Akn = 
{Un,ek) and Tkn = Akn — Skn- In order to prove that 111^11^2 < 1, we first 
observe that 

oo 

\\nh<J2TL- (223) 

fc,n 

This inequality follows easily by the definition of the L^-norm of T and by 
Schwarz's inequality. Now, let us estimate the series on the right end side 
of (223). We have 

E T'n = E ( E ^'" - 1 - 2(^- - 1) j • (224) 



By Parseval's inequality and by using the asymptotic estimate (219), we 
have 

oo 111 

|^AL-l|<C7i±|^Q, (225) 

k 

where C is a positive constant depending on Q. Moreover, again by (219) 

we have 

/ In 77/ \ 

\Ann-l\<0{-^\Q, as 77^00. (226) 

Now, by inserting (225) and (226) in (224) we obtain 

f:n. = Y.o{^)Q. (227) 

k,n n ^ ^ 
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The series on the right end side converges and, therefore, the condition 
||r|||2 < 1 is satisfied, provided that Q is smah enough. Then, the operator 
A is invertible and the family {C/„}^o is a complete family of functions in 
Lg^g„(0, 1). This implies the uniqueness result stated in Theorem 6.1. 

We conclude this section by recalling an extension of the above unique- 
ness result for near-constant fourth-order operator with two unknown coef- 
ficients due to Schueller (2001). Consider the eigenvalue problem 

u^y - {p.u'Y + P2U = Xu, in (0,1), (228) 

u{0) = = u(l), (229) 

u"(0) = = u"(l), (230) 

where Pi, P2 & ^^(0, 1). Let 

-p = {{pi,p2) e (i^(0, 1))2 I (228) - (230) has only simple eigenvalues}, 

(231) 
Sym = {{pi,p2) e V I p^{x) = p,(l - x) a.e. in (0, 1)} (232) 

and for any (ai, 02) G Sym let us define 

B(ai,a2;e) = {(Pi,P2)e(L2(0,l))2|||pi-p2||L2 + ||p2-a2||L^<e}. (233) 

Theorem 6.2 (Schueller (2001)). Let (01,02) S Sym with ai. 02 constants. 
There exists e > such that if {pi,P2) € ^(ai,a2;e), (pi,p2) G S(ai,a2;e), 
with (pi,P2) € Syin and (j)i,p2) € Sym, have the same spectrum, then: 

i) P2 = P2 implies pi = pi; 
a) pi = pi and /q (p2 - P2) = implies P2 - P2 = 0. 

Extensions of the above results to other sets of boundary conditions (e.g., 
cantilever beam) or to multi-span beams are open problems. 
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A Least Squares Functional for Solving 
Inverse Sturm-Liouville Problems 



Norbert Rohrl 

Department of Mathematics and Physics, University of Stuttgart, Stuttgart, 

Germany 



Abstract We present a method to numerically solve the Sturm- 
Liouville inverse problem using least squares following (Rohrl, 2005, 
2006). We show its merits by computing potential and boundary 
conditions from two sequences of spectral data in several examples. 
Finally wc prove theorems which show why this approach works 
particularly well. 

1 Inverse Problems with Least Squares 

General inverse problems can be formulated in terms of least squares as 
follows. 

Let /pi,....p„ : M — > M be a function with parameters p = pi, . . . ,p- 
Givcn a set of measurements {{i, Xi, fi)}, find parameters which minimize 



n- 



G(p) = E(/pi.-.P"(^»)-/0' 



The function / is a model for the physical system which we are 
measuring. By minimizing the least squares sum, we choose the optimal 
set of parameters to reproduce the measurements with minimal error in the 
^^-norm. If the function G(p) is sufBciently nice, we can use a gradient 
minimization scheme to compute parameters po with small least squares 
error G(po). 

Note that in general there is no unique solution and there might even be 
no satisfactory solution, if the model / is not suitable. In these notes we will 
prove that this method indeed works for inverse Sturm-Liouville problems. 
But let us first think about what could go wrong in the general case. 
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1.1 Gradient Flow 

Numerical gradient minimization algorithms approximate the gradient 
flow p(i) of G given by the equation 



dp(t) 
dt 



^VG(p(t)) , 



where VG denotes the gradient of the least squares sum G. The gradient 
flow p(t) is a curve in parameter space, which always points in the direction 
of steepest descent. 

Let us use two one dimensional functions to show the problems we can 
run into when using gradient based minimization. First, there might be 
local minima: 

Consider the function G : M — )• R, x i->- sin ( j^^ j 





There are three cases when starting the minimization procedure at xq and 
following the gradient: 

• |a;o| < y^ leads to the local minimum at a; = 0, 

• |a;o| > v5 will lead to infinity, 

• only an xq in between will lead to one of the two global minima. 
Second, the speed of convergence can be very slow, if the neighborhood 

of the minimum is flat. 

Consider the function G : M ^ R, x H^ x" with gradient flow 



dp(t) 
dt 



-np{t) 



n~l 



For n ~ 2 this gives 

p(t) = poe-2* and G(p(t)) = pge'^* , 
For n > 2 we get a Bernoulli DE with solution 



n(2-n) , 



p(f) = (n(n - 2)t + Po )-— and G(p(<)) = p'^'^'"'{7j{n - 2)ty- 



This means convergence to zero in this case could get as slow as t ^ . 
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Now we have laid down the principles of our approach. In the following 
chapters we will apply these ideas to the inverse Sturm-Liouville problem. 

2 The Inverse Sturm-Liouville Problem 

We consider the Sturm-Liouville equation 

-u" + q{x)u = \u (1) 

on [0,1] with q{x) £ L'^{[0,1],M.) real, and separated boundary conditions 
given by two angles a, /? 

M(0)cosa + u'(0)sina = 0, u(l) cos/3 + u'(l) sin/3 = . (2) 

For each boundary condition (a/3) there exists an infinite sequence of 
eigenvalues Xq^(^ap),n: n G N (Titchmarsh, 1962). The inverse problem is 
to compute the potential q and the boundary conditions a, /3 from (partial) 
spectral data Xq,{ai3),n- 

Before we think about recovering a potential, we have to clarify existence 
and uniqueness of a potential which satisfies a given set of spectral data. 

We start with uniqueness, which tells us how much spectral information 
we need to determine the potential. The answer was first given by Borg 
(Borg, 1946) and generalized by Levinson (Levinson, 1949), and tells us, 
that we need two sequences of spectra. Fix three angles a, /3, and 7 with 
sin(/3 — 7) 7^ 0. Let Xq^i^n, i G {1, 2} denote the eigenvalues with respect to 
the boundary conditions (a/?) resp. (017). 

Theorem 2.1 ((Borg, 1946; Levinson, 1949)). Given potentials Q,q G 
L^[0,1] with 

XQ,i,n = Aq,i,„ for alln eN, i = 1, 2 , 

then Q ^ q. 

The existence question is for which input data there exists a potential 
with the given spectrum. In other words: for which sequences {Xg^i^n) there 
is a g with Xq^i^n = Ag^i.n? The answer is affirmative, if the following three 
conditions are met: 

• The two spectra interlace, i.e. Aq,i,„ < AQ,2,n < Aq_i,„+i for all ti G N 

• Both spectra satisfy the asymptotic formulas. 

• The boundary conditions fulfill 

1. sin a, sin /3, sin 7 7^ (Levitan, 1987) or 

2. a = /3 = 0,sin7 7^ (Dahlberg and Trubowitz, 1984) 
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Now we can formulate the least squares functional for the inverse Sturm- 
Liouvillc problem. Given boundary conditions (q;/3), (aj) and partial 
spectral data Xq^i^n with (i, n) in a finite set / C {1, 2} x N of an unknown 
potential Q, we define the functional 

(n,i)e/ 

for a trial potential q. We immediately see that G{q) is non-negative and 
zero iff Xq^i,n = Aq,i,„ for all (n, i) e /. 

To compute the gradient of G we need the derivative of the eigenvalues 
wrt. the potential. If gq^i^n denotes the eigenfunction of L^-norm 1 

corresponding to the eigenvalue Xq,i^n, then the derivative of Xq^^n in 
direction h is (Poschel and Trubowitz, 1987) 



d_ 

dq 

and therefore 



Jo 



— G[h]{q)^2 Y^ / {Xq^,^n-XQ^^,n)gli^nhdx. 



Hence we have the gradient 

VG(g) = 2 ^ {Xq.i^n - Aq^,,„) gli^^ . 

Later we will prove the following theorem, which immediately implies the 
absence of local minima. 

Theorem 2.2. The set of squared eigenfunctions 

{glJii,n)e{l,2}xN} 

is linearly independent in H^ . 

Corollary 2.3. The functional G has no local minima at q with G{q) > 0, 
i.e. 

— G[/i](g) = V/i ■^=^ VG(g) = 0^=^ G(q) = . 

Proof. Since the g^„ are linearly independent, the gradient VG(g) is zero if 
and only if all differences Xq^i^n — Xq^i^n vanish. This is obviously equivalent 
to G{q) = 0. , , , , ^ 



A Least Squares Functional 67 

Later we also show, that the algorithm converges exponentially when we 
start with a potential which is sufficiently close to a solution. The precise 
statement for the convergence of the gradient flow is: 



Theorem 2.4. If G{qo) is sufficiently small, there exists a constant d> 
such that 

G{qt) < G(go)e-* 
for all t > 0. 

2.1 Examples 

We will now give a number of examples first published in (Rohrl, 2005). 
The conjugate gradient algorithm can be summarized as follows (see e.g. 
(Press et al., 1992) for a practical introduction). 

1. Start with test potential go, z = 0. 

2. While G{qi) too big: 

a) Compute eigenvalues and eigenfunctions of qi . 

b) Compute G{qi) and VG{qi). 

c) Choose a direction hi using conjugate gradients. 

d) Find a minimum along the line G{qi + ahi). 

e) qi+i = qi + aghi 

In our examples we always start with qo ~ 0, use 30 pairs of eigenvalues, 
and count one loop of the conjugate gradient algorithm as one iteration. 
In the figures we use A2 — \\Q — q\\2 to denote the L^-distance of our 
approximation to the correct potential, itcr^ for the number of iterations, 
and Ax — max/j „\g/ \Xq^i.n — ^Q,i.n\ to measure how far the eigenvalues of 
our approximation differ from the given ones. 

First we use Dirichlct-Dirichlet and Dirichlet-Neumann boundary 
conditions (a = /3 = 0, 7 = 7r/2). The graph below is the result after 
5 iterations. Further iterations do not change the graph visibly. 
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G{q) 


= 


4.1589 
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0.182951 
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2.2881 


10 


iter# 
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5 






Below we plot the values of G{q) and the difference A2 ~ H? — QII2 of 
the L^-norms of our approximation and the true potential. The left plot is 
logarithmic and we can see the exponential convergence in the number of 
iterations. 
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Changing the boundary conditions to a ~ (3 — 7r/4, 7 — — 7r/4 does 
influence the speed of convergence significantly, as the following example 
shows. Except for the boundary conditions this everything is exactly as 
before. 
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G{q) = 5.71990 ■ lO'^ 

A2 = 0.283107 

Aa = 2.14052 ■ 10-3 

iter# = 80 




The solid graph is the approximation after 80 iterations and the dashed 
one after 620 steps. The final result is as good as before, we just need 
roughly 100 times as many iterations. In the convergence graphs below, we 
still see the exponential convergence of G{q) to zero. 




301 



675 




The reason for the slow convergence can be found in the asymptotics of 
the eigenfunctions. Remember that the gradient of an eigenvalue was given 
essentially by the square of the corresponding cigenfunction. In the case 
a — j3 — Q, ^ ~ 7r/2 we have 






1 - cos ((2n + 2)7ra;) + 0{n^^) 
1 + cos ((2n + IJTTx) + 0{n''^) 
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and therefore the squares of the n-th eigenfunctions are asymptotically 
orthogonal. This means that every eigenvalue has its own dimension and 
they can be independently adjusted by gradient descent. 
In the case sin a sin /3 sin 7 ^ we have 

aini^) = 1 + cos ((2n)7rx) + 0{n~^) 
9lnix) -= 1 + cos {{2n)Trx) + 0{n~') 



which says that the squares of the n-th eigenfunctions are asymptotically 
equal. So the two spectra have to fight each other for each n. 

For practical purposes it is also important, that we get good results with 
noisy data. In the first example, we add uniform noise to each eigenvalue, 
A = A + [/(-0.01,0.01). 



Giq) 
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The results are still almost as good as without noise. Increasing the 
noise by a factor of 10 still yields essentially the right solution. 
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Finally, let us look at a smooth example. 



8.1190 




3 Recovering boundary conditions 

The spectra also contain information about the boundary conditions. We 
now show that our method also can recover them. 

We exclude the case of Dirichlct [a = 0) boundary conditions, since they 
are a singular case in the continuum of all possible values. Therefore, we 
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choose a more convenient formulation of the boundary conditions, 

hau{0) + u'{0) = 0, hiu{l) + u'{l)^0, q -.^ iho,hi,h2,q) . 
The corresponding eigenvalues satisfy the asymptotic formula 

A„ = TT^Ti^ + 2{hi - ho) + / q{s) ds + a„ , 

Jo 

where (a„) e P. It can be shown, that two sequences of eigenvalues 
corresponding to boundary conditions (/iq/ii) and (^10/12) with hi 7^ h2 
uniquely determine the boundary conditions (Levitan, 1987). 

We use the following abbreviations to denote the potential including the 
boundary conditions 

qi := {ho,hi,q), q2 -^ {ho,h2,q) , q -.^ {ho,hi,h2,q) ■ 

Basically we just have to add the boundary information to our functional 
and compute the full gradient. This leads to (Isaacson and Trubowitz, 1983) 

Functional G{q) := ^ {>^q,i,n - >^Q,i,nf 

(~^\ ( -.9q,,.„(0) \ 

Gradient of eigenvalues VAq,i,„ 



dhi 

dh2 
dX. 






5q,i,„(l)^*,2 



Gradient 



As before, we can also in this case prove the absence of local minima 
and exponential convergence of the functional. 



3.1 Examples 

These examples were first published by the American Mathematical 
Society in (Rohrl, 2006). The first example tries to recover the potential Q 
with true boundary conditions Hq — 3, Hi = 3,H2 = from qo = and 
ho = 2,/ii = 4, /i2 = —1. The solid graph is the approximation after 150 
iterations and the dashed one after 900. 
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G(q) 


= 


8.6764 


10-5 




iter# 


I 


0.66227 
150 






be 


= 


(2.844; 


3.035; 


0.033) 




Although the approximation docs get better with more iterations, we see 
in the graphs below, that the boundary conditions do not change after 150 
iterations. 
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This observations leads to the idea, that we could set q to zero again, once 
the boundary conditions stabilize. Below we set q^^^^ = g^^^^ = g^^^o) = 
and do get a significantly better approximation in 350 iterations. 
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G{q) 


= 3.62063 ■ 10-"^ 


Aa 


= 0.404896 


iter# 


= 350 


be 


= (2.9322; 3.0285; 0.02742) 




If we start from /ig — Q,hi = 0, /i2 = and set q — Q a, couple of times, 
the algorithm still converges. 
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Noisy input data leads to instability. The solid graph after 237 iterations 
is actually a better approximation than the dashed after 389. We again set 
g = at z = 89 to find better solutions. 
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G{q) 


= 1.50128 ■ 10~* 




A2 


= 0.47144 




iter# 


= 237 




be 


= (2.9229; 3.0345; 0.03347) 

W \ 
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J \i /V\'vyi/M 
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Without setting q = 0, the approximation would deteriorate much faster. 
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G(q) = 1.39225 . 11)-^ 
i ' A2 = 0.669478 
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Finally, again the continuous example. 



G(g) 
— A2 

1 itcr# 
be 



1.3369 ■ 10" 
0.0387494 
1887 

(2.99; 3.00; 
3.99- 10"*) 




4 Theory 

In this section we want to give a quick idea of why this approach works so 
smoothly. 
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4.1 Manifolds 

A manifold is a space which is locally homeomorphic to an open subset 
of a Banach space. This means in finite dimensions, that it is locally 
homeomorphic to an open ball in R". For this short introduction we will 
only consider finite dimensional manifolds. 

Let / : R" — > R™ be a diffcrentiable function with derivative dfx ■ R" — > 
M'" for all a:; £ R". The derivative is a linear map and if it is onto, we can 
decompose 

M» = T, © iV, , 

where dfx\T^ = and d/a;|Ar^ : TV — J> R™ is a linear isomorphism. We call 
Tx the tangential and Nx the normal space at point a; £ M". 

Theorem 4.1. For a constant c £ R™ the set Mc = {x £ R"|/(a;) = c} is 
a submanifold o/M" if dfx is onto for all x £ Mc- 

Proof. Straight forward application of the implicit function theorem. See 
also Poschel and Trubowitz (1987) D 

Example 4.2. Let / : R^ ^ R^, (a;i,a;2,a;3) ^^ (A + A+xl\ ^ = (\ 
Then the derivative of /, 

dfx:^'^^\y^Axy, Ax^i^l' ^o' ^f 

is onto for all x £ Mc, since X3 = and xf + X2 = 1. The decomposition in 
tangential and normal space is 




N^ = 



Lemma 4.3. If q : [0,r] ^ R", q{0) £ Mc is a curve such that 
dq{t) 



dt 



£T^(t), Vt£[0,T] 



then q{t) £ Mc for all t £ [0, T] . In other words, if a curve starts on Mc 
and its derivative lies in the tangent space for all times, then the curve will 
stay in the manifold Mc- 
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Proof. This is just a straight forward apphcation of the chain rule: 
^ If U\\ Af ^g(^) n 



dt 



Thus / is constant on the curve q. 



dt 



n 



4.2 From the Wronskian to a Scalar Product 

The Wronskian is an important tool for dealing with second order linear 
differential equations. Since we want to work in a geometric setting, we 
write it in form of a scalar product. 

If / and h are two solutions of a second order linear differential equation, 
the Wronskian of / and h is given by 

Wif,h)^fh'-f'h. 

Integration by parts of which yields 

1 .1 

Wif, h) dx = /(l)/i(l) - /(0)/i(0) - 2 / hf dx . 

Jo 

This in turn is the scalar product of the vectors 

-2/7 




in]R2 xL^[0,l] 



4.3 Linear Independence of the Eigenfunctions 

Using this scalar product for both boundary conditions simultaneously, 
we can show the linear independence of the squared eigenfunctions. 



Let s, 



g,z,n ; ^q^i.n 



be initial value solutions of the Sturm-Liouville equation 



-u" + qu — Xq,. 



1, 



.(1) 



<,.,„(!) 



-^1: C'qinil) 



Using the gradient of the eigenvalues and the vectors Vq^i^n and C/g,i,n 



VA„ 



9q,^,nWK2 



1 *^q.i,n 



( ~9l,niO) \ 
-9q,^,ni^) 

\2d,gl,^^{x)J 
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*^ q,i,n 



^q^i,nCq^i,n \^ ) 

one can prove the following theorem (Isaacson and Trubowitz, 1983) 
Theorem 4.4. For all i,j e {1, 2} and ti, m e N 

(^^q,i,7i, Vqj^m) = , (^Xq^i^n, Uqj^m) = <5ij(5m„ (— 1)* (/l2 — hi) . 

This immediately implies the linear independence of the gradients of the 
eigenvalues. 

Theorem 4.5. The set {VAg^i.„|(z,n) G {1, 2} x N} is linearly independent 
Proof. Suppose 

{i,n) 

Since the scalar product is continuous we can pull the sum in front and get 
for all {j,m) 

{iji) 

= 22 '^hn{'^Xq,i,ni Uq^^m) = aj,m (-l)'' (/j2 " ^l) 

{i,n) 

D 

This allows us to see that all potentials which produce our finitely many 
eigenvalues lie on a manifold. To that end fix a finite set of eigenvalues 
{AQ,i,„|(i,n) G / C {1,2} X N} and the map 

/ : M^ X L2[0, 1] ^ rI^I , q = {ho, hi,h2,q) ^ (Aq,,,„)(»,n)G/ 
whose derivative 

d/q : R^ X L2[0, 1] ^ rI^I , q ^ ( / VA,,,,„qda:; ) 

V-'O / {i.n)el 

is onto everywhere. The set M = {q = {ho, hi,h2,q)\f{q) = {>^Q,i,n){i^n)i^i} 
therefore is a manifold and {VAq,i_„ | (z, n) g /} is a basis of the normal space 
at q. 
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4.4 Exponential Convergence 

At first we look at general properties of the gradient flow of an arbitrary 
functional G 

t - -«<" 

Suppose it exists on some set Kq ■ The change of the value of the functional 
in t is given by 

dt dq dt Jo ^^ ' dt 



f |VG(gt)l'dx = -||VGfe) 

"'0 



If we can now find a constant d > such that ||VG(g)||| > dG(q) in Kc, 
we get 

^^ < -dG{q) , hence G{qt) < G{qo)c-''' , 
dt 

if qt does not leave Kc- 

If we can moreover find a constant d such that |1VG((7)||2 < dG{q) on 
Kc- 

||VGfe)ll2 < dGiqt) < dG{qo)e-'" on Kc - 

A short calculation using the Holder inequality now bounds \\qt — qo\\2- 

ht-qoWl = I (J ^G{qs)ds] dx 
»1 pt pt 



= 111 VGfejVG(g,Jdsids2dx 

Jo JQ Jo 

< [ [ ||VG(fe)||2||VG(g,J||2d.sid.s2 
Jo Jo 

= (J\\S/G{q,)\\2dx] < (J ^dG{q^)dx 

< dG{qo)(j C-''^'^dx\ <d^G{q„) 



Finally, we have the estimate 



ht - qoWl < d^G{qo) (3) 



A Least Squares Functional 81 

and therefore the maximal distance travelled by the gradient flow is bounded 
linearly in G{qo). 

To show that the gradient flow and the two constants actually exist in 
our case, one can show that 

O^I|VG(g)||i 



G(q) 

is a weakly continuous function of q. Using that Kc ~ {^llklb < C} is 
weakly compact we can conclude, that on Kc there exist d, d as above. 

For a fixed C > Ijgolb the gradient flow can not leave Kc by (3) if G{qo) 
is small enough. In summary we get the following theorem. 

Theorem 4.6. If G{qo) is sufficiently small, there exists a d> such that 

G{qt) < G{qo)c-''' 

for all t > 0. 

4.5 Other spectral Data 

There are also uniqueness results when one sequence of eigenvalues Aq.„ 
and either one of 

1. pn = /q z^dx, where z„ is a solution of — u" + qu ~ Aq^„u,u(0) = 
-1, u'(0) = ho (Gel'fand and Levitan, 1955). 

2. ^q „ = In (^^^7-%^) (Isaacson and Trubowitz, 1983). 

3. Cn (Levinson, 1949) defined as follows: For the two initial value 
solutions u\,v\ with u\{0) = — l,u';^(0) = /iq, wa(1) = ^Ia'a(I) ~ ^1 
there is are are constants C„ with ux = CnV\ if A = Aq,,i, since then 
both solutions are multiples of the eigenfunction. 

4.6 Isospectral Manifolds 

Finally I want to sketch, what (Isaacson and Trubowitz, 1983) showed 
about the structure of the set of potentials which have one fixed sequence 
of eigenvalues. First they show, that the sequences Aq^„ and ^q,„ uniquely 
determine the potential including boundary conditions 

Theorem 4.7. The map C{— (ho,hi,q) i— > (Aq^„,/q.„) is one-to-one. 

Then they define the isospectral set and show that it is a manifold. 

Definition 4.8. 

M(p) = {q|Aq^„ = Ap,„V71 > 0} 
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Theorem 4.9. M{p) is a manifold. A basis of its normal space at point q 
is {VAq.nJTi G N} and a basis of the tangent space is {yq,n}. 

They also explicitly compute the flow 

dq„(t) 
dt 

and show that 



= -K(q„(t)) 



; _ J 'q,.(o).m ruy^n 

tq„(t),m — S , 

['q„(o),« + ^ m = n 
Therefore we have the nice geometrical theorem. 

Theorem 4.10. (/q n)neN *s a global coordinate system, for M(p) and thus 
the manifold is connected. 



5 Similar Algorithms 

Finally, we list a number of similar algorithms, since no one is optimal for 
all problems. 

• RundcU and Sacks (Rundell and Sacks, 1992) developed an elegant 
an efficient algorithm based on Gelfand-Levitan-Marchenko kernels. 
On it's downside it needs the mean L Q dx of the potential and the 
boundary conditions as additional inputs besides the two spectra. 

• Brown, Samko, Knowlcs, and Marietta (Brown et al., 2003) invented 
an algorithm using Levinson (Levinson. 1949) spectral data. It does 
not need the mean as a separate input, but it is unknown if it also 
can be used to recover the boundary conditions. 

• Lowe, Pilant, and Rundell (Lowe et al., 1992) use a finite basis ansatz, 
and solve the inverse problem by Newton's method without requiring 
the mean as input. 
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1 Introduction 

1.1 About the paper 

The boundary control method (BC-method) is an approach to inverse 
problems based on their relations to control theory. Originally, it was pro- 
posed for solving multidimensional problems (see [2]) and therefore such an 
approach cannot be simple: the BC-mcthod uses asymptotic methods in 
PDEs, functional analysis, control and system theory, etc. The aim of this 
lecture course is to provide possibly elementary and transparent introduc- 
tion to the BC-mcthod. Namely, we present its 1-dimensional version on 
example of two dynamical inverse problems. 

The first problem is to recover the coefficient q ~ q{x) in the string equa- 
tion putt — Uxx + qu ^ Q with p = const > on the semi-axis x > via 
the time-domain measurements at the endpoint a; = of the string. The 
problem is classical and well investigated, its versions were solved in 60-70's 
in the pioneer papers by Blagovestchenskii, Sondhi and Gopinath. So, this 
part of the course is of educational character; we solve a classical problem 
but by another technique: the main tools of the BC-method (amplitude for- 
mula, wave visualization, etc) arc introduced and applied for determination 
of q. 

The second problem is more difficult and richer in content: we deal with 
a vector dynamical system governed by the beam equation puu — u^x + 
Aux + Bu = in the semi-axis a; > with a constant diagonal 2x2-matrix 
p = diag {pi,p2}, < pi < /32 and variable 2x2-matrices A{x), B{x). 
Such an equation describes the wave processes in a system, where two dif- 
ferent wave modes occur and propagate with different velocities. The modes 
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interact; interaction provides interesting physical effects and in the mean 
time compUcates the picture of waves. An inverse problem is to recover the 
coefficients A, B \x>o via the time-domain measurements at the endpoint 
a; = of the beam. We solve this problem by the BC-method, the parallels 
to the scalar case (string equation) being clarified and made use of. In con- 
trast to the first part of the course, where we " solve a solved problem" , the 
results concerning to two-velocity dynamical systems are close to the state 
of the art in this area of inverse problems. 

1.2 Comment, notation, convention 

• About the style. We try to make the course available for the reader 
interested in applications. By this, we do not formulate lemmas and 
theorems but mainly derive formulas. However, every time we outline 
(or give a hint how to get) the rigorous proofs and provide the reader 
with proper references. 

• All functions in the paper are real. The following classes of functions 
are in use: 

the space C[a,b] of continuous functions and the space C''[a,b] of k 
times continuously differentiable functions; 

a Hilbert space ^2(0, 6) of square summablc functions provided with 
the standard inner product 



iy^'")L2ia,b) — / yis)v{s)ds 



and the norm ||2y||L2(a,6) := i.V-,v)l^(a,by 

the Sobolcv class H^ [a, b] of differentiable functions y with derivatives 

y' €L2{a,b). 

• Sometimes, to distinguish the time intervals a < t < (3 from the space 
ones a < a; < 6 we denote the space intervals by Jl'"''') :— (a, b) and 
put Q.'' := (0,a). 

• Convention All fvmctions depending on time t > are assumed to 
be extended to t < by zero. 

1.3 Acknowledgements 

The author is grateful to I.V.Kubyshkin for kind help in computer graph- 
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PART I: 1-CHANNEL DYNAMICAL SYSTEM 
(STRING) 



2 Forward problem 

2.1 Statement 

We deal with an initial boundary value problem of the form 

putt - Uxx + qu = {), x>0, 0<t<T (1) 

u\t^o = "t|t=o = 0, a; > (2) 

uU=o = f, 0<t<T, (3) 

where q = q{x) is a continuous function ( potential) on a: > 0; p > is a 
constant (density); the value t =: T < oo is referred to as a final moment; 
f = f{t) is a boundary control; u — u-^ {x,t) is a solution. The solution u-' 
is interpreted as a wave initiated by the control /, the wave propagating 
along the semi-axis (string) x > 0. We call (l)-(3) the problem 1. 

In parallel we consider the problem 1 

putt -Uxx = 0, x>0, 0<t<T (4) 

M|t=o = Ut\t=o = 0, x>0 (5) 

u\x=o = f, < t < T (6) 

and refer to (l)-(3) and (4)-(6) as perturbed and unperturbed problems re- 
spectively. The solution of the unperturbed problem is denoted by u-^ (x,t); 
it can be found explicitly ^: 

u/(.T,t) = /(t-^) , (7) 

where c := -i= is a wave velocity. 

We shall illustrate the considerations with the figures, which show con- 
trols and waves (at the final moment) on Fig. 1. 

2.2 Integral equation and generalized solutions 

The main tool for investigating the problem 1 is an integral equation, 
which is equivalent to (l)-(3). Seeking for the solution in the form u^ = 



Regarding the values of / for i — - < 0, recall the Convention accepted: f\t<0 = 0! 
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^ *-a; 



Figure 1. Controls and waves 



u' + w with a new unknown w = w' {x, t), we easily get 



pwtt - Wxx = -qw - qu^ J 
w|t=o = Wt|t=o = 0, 
wU=o = 0, 



a: > 0, < t < T (8) 

x>Q (9) 

0<t<T. (10) 



Applying the D'Alcmbcrt formula (see, e.g., [30]), we arrive at the equation 

w + Mw = -Muf x > 0, 0<i<r, (11) 

where an operator M acts by the rule 

1 



{Mv)[x,t) := 



2c 



q{€)w[^,ri)d^dri 



(12) 



Ka{x,t) 



and Kc{x,t) is the trapezium bounded by the characteristic lines t ± - = 

const . The equation (11) is a second-kind Volterra type equation; it can 

be studied by the standard iteration method (see, e.g., [30]). As can be 

shown, if the control / G C^[Q,T] satisfies /(O) = /'(O) = /"(O) = then 
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<^T 

Figure 2. The domain Kc{x, t) 



its solution w^ is twice continuously differentiable. As a result, the function 
w = uf + w^ turns out to be classical solution of the problem 1. 

As is easy to see, for any / e i2(0,T), the r.h.s. Mu^ of the equation 
(11) is a continuous function oi x,t vanishing as i < -. Simple analysis 
shows that the equation is uniquely solvable in this class of functions. In 
this case, we define the function u^ := u^ + w^ to be a generalized solution 
of the problem 1. So, for any control / G L2(0,r), such a solution u^ does 
exist and is unique. 



2.3 Fundamental solution 

Denote 



0, 



0<i<e 



t> £ 



and recall that 6^ — > 5{t) (the Dirac delta-function) as e — > in the sense 
of distributions. 

Putting / = (5e(i) in (11) we get the solution w^^^ belonging to the class 
C ([0,r]; ^2(0, 6)). In the mean time, letting e ^ we easily see that 
the r.h.s. of (11) Mu^' tends to Afu"^, the latter being a continuously 
differentiable function of x, i for < - < t < T and vanishing as - > i. 
Thus, even though the limit passage leads to a singular control / = 5(i), 
the solution w^ := lime_,o w*^^ does not leave the class C ([0, T]; £2(0, 6)). 
Simple analysis provides the following of its properties: 

1. w is continuously differentiable in the domain {{x,t) | < a; < 



cT, ^ <t<T} and w'' 



\t<^. 
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2. w^O,t) =0 for all t> 

3. the relation 



w' 



(x-O,-) ^ -^ q{s)ds x>0 (13) 



holds ^ and shows that w^ may have a jump at the characteristic line 
t = - (whereas below the line w^ = holds). We derive (13) later in 
Appendix I. 

In the unperturbed case oi q = 0, we easily have w = 0, whereas 

u\x,t) = ^(i--) (14) 

satisfies (4) in the sense of distributions and is called a fundamental solution 
of the problem 1. Analogously, the distribution u^ := u* + w* of the form 

u\x,t) = S(t~-) + w\x,t) (15) 

is said to be a fundamental solution of the (perturbed) problem 1. It 
describes the wave produced by the impulse control / = 6{t) (see Fig 3). 

Such a wave consists of the singular leading part d (t — |) propagating 
along the string with velocity c and the regular tail w {x,t), which may 
have a jump at its forward front. The presence of the tail is explained by 
interaction between the singular part and the potential. Also, note that the 
singular part in the perturbed and unperturbed cases is one and the same 



2.4 Properties of waves 

Return to the problem 1 and represent the control in the form of a 
convolution w.r.t. time: f{t) — {S * f){t) = J_ 6{t — s) f{s) ds. Since 
the potential in (1) does not depend on time, by the well-known linear 
superposition principle we have 

uf = u^*f ^ u^ ^ f ^ {u^ +w^)^ f = u^ * f + w^ * f 



Here w (x — 0, -) is understood as linisi^ w (s, — ) . 

Physicists comment on this fact as a principle: Sharp signals do not feel a potential. 
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6(T-^) 



^w^*- 




^cc 



Figure 3. Fundamental solution 



that implies the representation 



u^{x,t) — fit ") + / w^{x,t~s)f{s)ds — (see sec 2.3, property 1) 



= f(t--)+ I w\x,t-s)f{s)ds, x>0, t>0, (16) 



which is often called the Duhamcl's formula. The listed below properties of 
the waves easily follow from (16): 

• For any / G L2{0,T), the relation 



u^ |t<x =0, a;> 0, t > 



(17) 



holds, which is interpreted as the finiteness of the wave propagation 
speed. 



92 



M.I. Belishev 



• For / e 2.2(0, T) and r G [0, T], define a delayed eontrol fr £ ^2(0, T) 

by 

/,(i) := /(i~T), 0<i<T, 

where t is the value of delay (recall the Convention!). Since the po- 
tential does not depend on t, the delay of the control implies the same 
delay of the wave: the relation 



u-'^{x,t) :— u^ [x^t — t) 



(18) 



is valid. 
• Let a control / be a piece-wise continuous function, which has a jump 
at i = ^ (see Fig 4). By the properties of w^ , the integral term in (16) 
depends on x,i continuously. Therefore, uf{x,t) and f (t — ^) have 
one and the same jump at the characteristic line t — - -l-^, the values 
of the jumps being connected as 



u* {x,t) 



x=c{t-£,)+0 



= -/(.'^) 



x=c{t-^)-0 



s=i+0 



=i-o 



t> 0. 



(19) 



In particular, if / vanishes for < t < T — ^ and has a jump at 



u>,T) 




:(T-g "^T 

Figure 4. Propagation of jumps 



*-x 
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t — T — ^ then u^{- ,T) vanishes for x > c£,, the ampUtudes of the 
jumps being related through (19) as follows: 

M-''(cC-0,T) = /(T-^ + O) 0<^<T. (20) 

The equalities (19) and (20) is the simplest version of the relations, 
which are known in PDE-theory as geometrical optics formulas. Such 
formulas describe a propagation of the solution singularities in the 
wave processes governed by hyperbolic equations. 

2.5 Extended problem 1 and locality 

Return to the integral equation (11) for the regular part w^ of the 
fundamental solution. Representing in the form of the Neumann series 

oo 

w^ = ^ {—!)'' M''u^ and looking at Fig 2, we easily see that the values 

k=0 

w^{x,t) ior < X < ^, t <T- ^ (under the line t = T- ^ on Fig 5a) are 
determined by the values of the potential q lo<2;<^^ o^^ly (do not depend 
on the behavior of q in x > ^ ) . 

Such a dependence is an inherent feature of the wave processes with the 
finite speed of the wave propagation; it is known as a locality principle (see, 
e.g., [17]). It motivates to extend the problem 1 as follows: 

putt - u^x + gu = 0, (x, t) e A^^ (21) 

u|,<. = (22) 

u|.=o = /, 0<t<2T, (23) 

where A^^ := {{x,t)\0 < x < cT, < t < 2T - ^} (see the shaded domain 
on Fig 5b). Seeking for the solution in the form u-^ = f (t — -) + w^ , one 
can reduce the problem to the integral equation 

w + Mw = -Mii^ in A^^, (24) 

which is quite analogous to (11) and, for any / € ^2(0, 2T), is uniquely 
solvable in a relevant class of functions in A^-'". So, (21)-(23) is a well- 
posed problem, its solution being determined by q |o<2-<cT- 

Taking in (23) / = 5{t), one can construct the fundamental solution 

u\x,t)=5{t--\+w\x,t), (x,i)eA2^, (25) 
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Figure 5. Locality and extended problem 



and represent 



/(t-^)+ f w\x,t-s)fis)ds, ix,t)eA'^ 



(26) 



that extends the domain of definition of the Diihamcl representation (16). 

Take a smooth control / provided /(O) = 0. Differentiating in (26) w.r.t. 
X and putting a; = 0, we get 



ii{0,t) ^-^pf{t) + j r{t^s)f{s)ds, 0<i<2T, (27) 
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where r{t) := w!j(0,i) is caUed a reply function. It is a smooth function ^; 
it win play a central role in the inverse problem. 

Remark By the locality principle, the reply function r |o<4<2T is deter- 
mined by the part q |o<a:<cT of the potential. 

3 String as dynamical system 

3.1 System a^ 

Here problem (l)-(3) is considered in terms of the control theory and 
endowed with standard attributes of a dynamical system. The system is 
denoted by a^ . 

Outer space The Hilbert space of controls (inputs) T'^ := L2{0,T) with 
the inner product 

T 

{f,9)^T ^ J f{t)g{t)dt 



is said to be an outer space of the system a^ . It contains an increasing 
family of subspaces 

consisting of the delayed controls (T — ^ is the value of the delay, ^ is an 
action time). Note that J^^'O = {0} and J^^'^ = T'^ . 

Inner space Recall that by £7° we denote the interval (0, a) of the semi- 
axis X > 0. The space Ti^-^ := L2,p{^'^'^) with the inner product 



(m, w)^cr — / u{x)v{x)pdx 

is called an inner space of the system a^ . For each / G T^ , at any moment i, 
the wave u^{- ,t) is supported in il'^^ (see (17)) and, hence, can be regarded 
as a time-dependent element of TC^^ . In control theory, u^{- ,t) is referred 
to as a state of the system at the moment t. So, TC^^ is a space of states. 
The inner space contains an increasing family of subspaces 

n^^ := {y e n^^ I 2/U>ce = 0} , < e < T. 



The case r{+0) ^ is possible, i.e., the reply function may have a jump at i = only. 
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Control operator The input i-^ state correspondence of the system a^ 
is reahzed by a control operator W'^ : T^ — > Ti*^^, 

W'^f := u^{-,T) 

which creates the waves. Putting t := T in (16) and changing the variable 
in the integral, we get a representation 

cT 

{W^f)ix) = f(T-^) + Jw{x,s)f[T-^)ds, xen^^, (28) 

X 

where w{x, s) :— - w^{x, -). As is easy to see, W^ is a bounded operator. 
Recall that the delay of controls fr is defined in sec 2.4. Introduce the 
delay operation 13^' ^ : f >—> fr-^ as an operator in the outer space T'^ and 
note that D'^'^T'^ = J^^'«. Applying (18) for r = T - ^, f = T, we get 

W^D'^'if ^u^{-,0. (29) 

Since D^'^f € T'^'^, the wave u^{-,£^) is supported in fl'^^ (see (17) for 
t = £,), i.e., the inclusion u-^(-,^) £ H''^ holds. Hence, (29) implies the 
embedding 

w^T^'^f cn"^, 0<C<T. (30) 

As we shall see later in sec 3.3, "C" can be replaced by "=". 

Response operator An input i— > output correspondence in the system 
a^ is realized by a response operator R^ : T^ — > J-"^ defined on controls / 
of the Sobolev class 7Ji[0,T] provided /(O) ^ 0. 

{R^f){t) := u^(0,t), 0<t<T. 

In mechanics, u^(0,i) is interpreted as a value (at the moment t) of the 
force generated at the endpoint x = of the string by the wave process 
initiated by the control /. 

Differentiating in (16), taking into account the property 2 in sec 2.3, and 
recalling the definition of the reply function, we arrive at the representation 

t 

(R^f) it)^-l fit) + JwiiO,t~ s) f{s) ds = 



t 
-y^nt) + Jrit-s)fis)ds, 0<t<T. (31) 
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The presence of differentiation in (3f ) renders R^ to be an unbounded 
operator. Its adjoint (R^) is also unbounded. As is easy to show, it is 
defined on the controls / of the Sobolev class H^[0,T] provided /(T) = 
and acts by the rule 

T 

((it;^)V)W = \/p/'W+/rGs-t)/(s)ds, 0<t<T. (32) 



Return to the extended problem 1 and associate with it the extended 
response operator R'^^ acting in the space J-^^ := L2{0,2T) on controls / 
of the class H^[0, 2T] provided /(O) = by the rule 

(i?2^/)(i) :- u/(0,i), 0<i<2T, 

where u-^ is the solution of (21) -(23). Quite analogously to (31), it can be 
represented in the form 

t 
{R^^f){t) ^ -^f'{t)+Jrit-s)fis)ds, 0<t<2T. (33) 



The extended response operator is one more intrinsic object of the system 
a^ . By the locality principle, R^'^ is determined by the part q Jj^ct of 
the potential ^. 

Connecting Operator Since the control operator W^ acts from a Hilbert 
space !F^ to a Hilbert space H'^^ , its adjoint (W^) acts from H'^^ to J^'^ . 
By this, an operator 

is well defined. It acts in the outer space J^ and is said to be a connecting 
operator of the system a . By continuity of W , the operator C is also 
continuous. By the definition, one has 

= {u^i;T),u3{;T))^^^ , (34) 

i.e., C^ connects the metrics of the outer and inner spaces. 
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The connecting operator plays a key role in the BC-method owing to the 
following remarkable fact: it can be represented via the response operator 
in explicit and simple form. To formulate the result we need to introduce 
auxiliary operators: 

• The operator S'^ : T^ ^ T'^'^ , 

^ '^' |-/(2r-i), T<t<2T 

extending the controls from (0,r) to (0, 2r) by oddness w.r.t. t = T. 
As is easy to check, its adjoint ("S*"^) : T^^ -^ JF^ acts by the rule 

((5^)*.g)(i) =5(t)-.9(2T-t), 0<i<r. (35) 

• The integration operator J^'^ : JF^-^ -^ T^^ , 

t 
{J^^f){t)^Jf{v)dv, 0<t<2T. 



The integration commutes with the response operator: 

this easily follows from the representation (33). 

The formula, which expresses the connecting operator via the response 
operator, is 

C^ = -i (5^)*i?2Tj2T^T. (37) 

its derivation see in Appendix I. Furthermore, substituting (33) to (37) one 
can represent C"^ through the reply function: 

T 

{C^f){t) ^ ^fit) + Jc^{t,s)f{s)ds, 0<t<T, (38) 



with the kernel 

2T-t-s 

c^{t,s) :== - / r{-q)dr] 0<s,t<T. 

\t-s\ 

To derive (38) from (37) is a simple exercise. 
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The foUowing fact wih be used later in solving the inverse problem. As- 
sume that the external observer investigates the system ol^ via its input i-^ 
output correspondence. Such an observer operates at the endpoint a; = 
of the string; he can apply controls / and measure u^ \x=ii but, however, 
cannot see the waves u^ themselves on the string ^ . As result of such mea- 
surements, the observer is provided with the reply function r |(o.2T)- If so, 
the observer can determine the operator C^ by (38) and then, for any given 
controls f,g& T'^ ., find the product of the waves (u-^( • , T), u^( • , T))^^^ 
by (34), even though the waves themselves are invisible! As we shall see, 
such an option enables one to make the waves visible. 

In conclusion, we present the main objects of the system a^ on the 
diagram: 



external 
observer 



x = Q 



inner 
space 




outer 
space 



Figure 6. System a^ 



3.2 Controllability 

The question, which we consider here, can be posed as follows: Can one 
manage the shape of the wave on a string? More precisely: Is it possible 
to drive the system a^ from the initial zero state (see (2)) to a given final 
state u-f{-,T) = yhy means of the proper choice of the boundary control 
/? This sort of problems is a subject of the boundary control theory, which 
is a highly developed branch of mathematical physics (see, e.g., [26], [29]). 
Here the affirmative answer to the posed question is provided. 



that is a rule of game in inverse problems! 
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There is an evident necessary condition for the above-mentioned prob- 
lem to be solvable. In view of the property (17), the function y has to be 
supported in the interval O'^^ filled with waves at the final moment. There- 
fore, the relevant setup is the following: given function y G T-C^^ , to find 
control f £ J-^ such that 

uf{x,T) = y{x), xen"'^ (39) 

holds. It is what is called a boundary control problem (BCP). The BCP is 
obviously equivalent to the equation 

W^f = y. (40) 

In the case of the unperturbed system, by (7) the BCP takes the form 
of the equation 

/(r-^) =y{x)., xen^^, 

which has the evident solution 

fit) ^y{ciT-t)), 0<t<T. (41) 

In the perturbed case, changing the variables in (28) we get 
t 
f{t) + j k^{t,s)f[s)ds^y{c{T-t)) , 0<t<T (42) 



with k'^it, s) := w {c{T — i), c{T — s)) that is a second-kind Volterra equa- 
tion w.r.t. /. As is well known (see, e.g., [30]), such an equation (and hence 
the BCP) is uniquely solvable in T^ for any r.h.s. In operator terms, this 
means that the control operator is boundedly invertible, its inverse {W^) 
being defined onto Ti"^-^. 

This result can be also interpreted as follows. The set of waves 

U^ := K(-,r)|/e.F^} - W'^T'^ 

is called reachable (at the moment t — T). The solvability of the BCP is 
equivalent to the relation 

U^ ^ n"^ , T > , (43) 

which shows that our system can be steered from zero state for any state by 
proper choice of the boundary control. In control theory, such a property of 
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a dynamical system is referred to as a controllability. Taking into account 
the finitcness of the wave propagation speed, we specify it as a local boundary 
controllability of the system a^ . 

For inverse problems, controllability is an affirmative and helpful prop- 
erty. A very general principle of system theory claims that the richer the 
set of states, which the observer can create in the system by means of the 
given reserve of controls, the richer information about the system, which 
the observer can extract from external measurements [22]. As we shall see, 
the BC-mcthod follows and realizes this principle. 

3.3 Wave basis 

Here wc make use of controllability of the system a^ for storing up an 
efficient instrument, which will be used for solving the inverse problem. 

Fix a positive ^ < T. Since T > in (43) is arbitrary, one can put t ^ T 
and r = r — ^ in (18) and rewrite the controllability relation in the form 

WT-pT,i ^q-ici ^ < ^ < T . (44) 

Let us choose a basis of controls {/ }°li in the subspace T'^'^. Since 
W^ is a boundedly invertible operator, the relation (44) yields that the 

corresponding waves {u-'j ( • , T)}°^]^ constitute a basis of the subspace T-U^. 
For the needs of the inverse problem, it is convenient to make the latter 
basis orthonormalized. To this end, we recast the basis of controls by the 
Schmidt orthogonalization process w.r.t. the bilinear form {C^ f,g) ^t'- 



~9l ■■- /I - {c^fl , 9i)^^ 9i , 9i ■■- ^ 



C^~9l9i 

9l 
C^~9l , gl 



T"^ 



^T 



fe-1 _f 



^i-/l-E(^"i^.4) 



j^i 
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T-r 



x\[c^~9\r9i,^^ 

(45) 

and get a new system of controls {gf }^i, which is also a basis in JF^'? and 
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satisfies 

{c^9l9f)^^=S,i k,l = 1,2,... (46) 

by construction. The corresponding waves {uj.}'^^ , uj, :— w^fc(-,r) = 
W^gl form a basis in the subspace Ti.'^^, the basis turning out to be or- 
thonormal. Indeed, we have 

(see (46)) = 4z k,l^l,2,... . 
We say {u|}^i to be a wave basis in Ti'^^. 

3.4 Truncation 

As in the previous section, we keep ^ € (0, T) fixed. In the inner space 
TC^^ introduce the operation P'^^ 



y{x) , X e Vl"^ 

, xen-'^X Vt"^ 



{P''y){^)-, . _o.T 



that truncates functions onto the subintcrval J7^^ C VL'^'^ . As is easy to 
see, P'^^ is the orthogonal projection in TY"^-^ onto the subspace TC^^ and, 
expanding over the wave basis, we can represent the truncated function in 
the form of the Fourier series: 



oo 



P=S = E 2^'"9«e."i- (47) 



«' 



By the controllabihty (43), the function y £ Ti"^-^ can be regarded as 

a wave produced by a control / = (T^^) V- By the same reason, the 
truncated function P"^^ y g TC^^ is also a wave produced by a control /^ := 
(W'^) P'^^J/; the control belonging to the subspace .F'^'^ by (44). Thus, 
we have a correspondence / f— > /^, which is realized by an operator 

V^ -.^ {W'^y^ P"^ W'^ (48) 

acting in the outer space JF"^. In other words, the truncation y i— > P'^^y in 
the inner space induces a truncation-like operation f >-^ V^f in the outer 
space (see Fig 7). 
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c^ 

Figure 7. Truncation 



*-x 



An important fact is that the latter operation can be represented in the 
form of the expansion over the system {.g^.j^x' namely, the relation 



7'«/ = 5](c^/,5^j_5l- 



fc=i 






(49) 



is valid. Indeed, substituting y — W'^ f and u| — W^gl in (47), we have 

oo 

P^iW^f = J2i^^f^^^9i)^^^W^gi = (see (34)) 



fe=i 



H" 



w- 






fc=i 



9k]^^9k- 



Applying iW^) and taking into account (48), we arrive at (49). 



3.5 Amplitude formula 

Now, let us derive a relation, which represents the values of waves 
through the operator V^ . Let a control / e JF^ be continuous on [0,T]; by 
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(28) the corresponding wave u^ {■ ,T) is continuous on O^-'". The following 
equality holds and will play a key role in the inverse problem: 



Jic^.T) = (T'?/) (T - e + 0) = (sec (49)) 



E(^"/'5i.).,4W 



Lfc=l 



.F^ 



o<^<r. 



(50) 



t=T-5+0 



Indeed, the truncated wave (^P'^u-^{- ,r)) (x) vanishes for x > c^, is con- 
tinuous in rj"^^, and has a jump 



(p^«u/(.,r))(x) 



x=c(,+0 



x=c^-a 



uf{c^,T) ^ -uf{c^,T). (51) 



In the mean time, the truncated wave P'^^iJ ( • , T) is also a wave produced 
by the "truncated" control: P^^u^ {■ ,T) — u^ ^{-.T). Hence, one can 
apply the jump relation (19): replacing there C by T — ^ and taking t = T 
in the l.h.s., we get 



,^^/i 



x,T) 



x=c^+0 



x=c^-0 



V^fis) 



s=T-C+0 



s=T-C-0 



7'«/(r-e + o) (52) 



because V^f belongs to the delayed subspace JF^-^ and, hence, vanishes for 
< t < T — 5. Thereafter, comparing (51) with (52), we easily arrive at 
(50). 

We say the representation (50) to be the amplitude formula (AF). The 
reason is that it represents the wave through the jump amplitudes of the 
control, which appear as one projects the control on the subspaces !F'^'^ 
by the operators V^ . The background of the AF is geometrical optics; its 
various versions play the role of key tool of the BC-method [7], [12]. 



3.6 Special BCP 

In this section we deal with the problem (39) with a special r.h.s. y. 
Consider the Cauchy problem 



p" + q{x)p = 0, x>0 

P U=0= Oi, p \x=Q^ P 



(53) 
(54) 



(a and /? are the constants) . By standard ODE theory, such a problem has 
a unique smooth solution p = pafi{x). In what follows we assume that a, /3 
are fixed. Therefore we omit these subscripts and write just p{x). 
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Consider the special BCP: find a control / e T'^ satisfying 

u^(-,r) = p inVl"'^ . (55) 

By (43), sueh a problem has a unique solution /-^ e J-'^ and, applying the 
AF (50) for f :^ f^ we get 



p(cO = 



Lfc=l 



, o<e<r. 



(56) 



t=T-{+0 



The remarkable fact is that, in this ease, the coefBcients of the series can 
be found explicitly. 

To find the coefficients, let us take a smooth control g £ T^ provided 
g(0) = ff'(O) = 5"(0) = 0. The corresponding wave u^ satisfies (l)-(3) in 
the classic sense; therefore it vanishes at its forward front together with the 
derivatives and we have 



u3{cT,T) = ul{cT,T) = 0. 
This enables one to justify the following calculations: 

{C^f,9)^T = (see(34)) = (uf" {■ ,T),u3{- ,T)) ^^ 

cT 

(see (55)) = p{x)u3{x,T) pdx = 



(57) 



■HcT 



(by the zero Cauehy data (2) for the wave t 



cT 



p{x) 



(T — t) u^^{x, t) dt 



pdx 



T cT 

dt{T-t) [ {p{x),puUx,t)) dx =(see(l)) 




cT 



.*(T-<,y,WK.K„-,W.»(..<)l<i. = («(57)) 





(58) 
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T 

= f dt{T~t){-p{0)u3^{0,t)+p'{0)u^0,t) + 


cT 

[p"{x) — q{x)p{x)] u^{x, t)dx} = 

T 

(see (53), (54)) = /(T - t) [-a (R^g) {t) + (3g{t)] dt = 



where ^c^ G JF"'", >f(t) -.^ T — t. Thus, we have got the equahty 

{C^f,9)^T = {- {R^y >^^« + ^^P--9)^^ (59) 

for classical g's. Since such controls constitute a dense set in T'^ , the evident 
limit passage extends (59) to arbitrary g e T^ . 

Taking in (59) / = gj, and substituting in (56), we arrive at the following 
important representation: 



picO = 



J2{-{R^Y.^a + .^(3,gi) giit) 

.k=l 



, 0<S_<T. (60) 

t=T-$,+0 



The significance of this formula is that it represents the function p, which 
is an object of the inner space, through the objects of the outer space. We 
shall see that it is the fact, which enables the external observer to use (60) 
for solving the inverse problem. 

3.7 Gelfand-Levitan-Krein equations 

There exists one more way to characterize the solution of the special BCP 
(55) in intrinsic terms of the outer space: the control f ^ f^ coincides with 
a unique solution of the equation 

C'^f ^ -a {r'^Y k^ + iiyt^ . (61) 
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In more detail, in accordance with (38) and (32), this equation takes the 
form 



V~pfit) 



2T~t-s 



r{i]) dr] 



\t-s\ 



a 



Vp + J ^(s -t)iT- s) ds 
t 



f{s)ds 



+ /3(r-i), o<t<r. (62) 



The relation (61) follows from (59) just by arbitrariness of g. Applying the 
jump relation (20) for ^ = T, we obtain 



/^(+0) = u^^ {cT - 0, T) = p{cT) 



(63) 



For solving the inverse problem, not a single equation (61) will be enlisted 
but a family of such equations corresponding to the family of the special 
BCP's 

m/(.,^)=P inf7'=«, (0<e<T). (64) 

Just replacing T by ^ in the previous considerations, we conclude that the 
solution / ~ /^ e JF^ of (64) coincides with a unique solution of the equation 



C^f = -a{R^y >c^ + /3k^ 



(65) 



or, equivalently. 



v~pm 



2^-t-s 



r{rf) dr] 



\t-s\ 



-y/p + r{s - t) iC - s) ds 



fis)ds 



+ /3(^-t), 0<i<e. (66) 



Also, in perfect analogy to (63), the relations 

/«(+o)=p(cO, o<e<r 



(67) 



are valid. 

For each ^, the equation (65) is a 2-order Frcdholm integral equation. 
Since W^ is a boundedly invertible operator (see sec 3.2), the same holds for 
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the connecting operator C^ — {W^^ W^, which turns out to be a positive 
definite isomorphism in T^ . Hence, each equation is uniquely and stably 
solvable. 

The family of the equations (66) plays the key role in solving the in- 
verse problem. We refer to them as the Gclfand-Levitan-Krein equations 
(GLK). The reason is that the classical Gelfand-Levitan equations, which 
for the first time were derived for solving the spectral inverse problem for 
the Sturm-Liouville operator (see [19]), can be obtained from (66) by dif- 
ferentiation and simple change of variables. In the same way, dealing with 
the special BCP for the inhomogeneous string with a variable density ^, 
one can derive the relevant analog of (66) and then reduce it to the classi- 
cal M.Krein equation [23]-[25], as is done in pioneer works by Sondhi and 
Gopinath [20], [21] and Blagovestchenskii [17], [18] (see [8] for detail). So, 
in the framework of the Boundary Control method, the classical inverse 
problem equations are interpreted as the equations in the outer space of the 
system a^ , which provide the solutions of the relevant special BCP's. To the 
best of our knowledge, for the first time, such a look at the Krein equation 
was proposed in [20] . It is also relevant in multidimensional problems [3] . 

4 Inverse Problem 

4.1 Statement 

The setup of the dynamical inverse problem is motivated by the locality 
principle or, more exactly, by the local character of dependence of the re- 
sponse operator on the potential. Recall that the operator i?^-^ associated 
with the extended problem 1 is determined by q |qct (see the remark be- 
low (33)). This property is of transparent physical meaning. Namely, the 
response of the system a^ (i.e., the force u^(0, t) measured at the endpoint 
of the string) on the action of a control / is formed by the waves, which are 
reflected from inhomogeneities of the string and return back to the endpoint 
a; = (see the arrows going to the left on Fig 8). Since the wave propagation 
speed is equal to c, the waves reflected from the depths x > cT return to the 
endpoint later than t = 2T; they are not recorded by the external observer 
measuring the values w|(0,i) for times < i < 2T. Therefore, the operator 
R^^ , which corresponds to these measurements, contains certain informa- 
tion on q IfjcT but "knows nothing" about q\x>cT- Taking into account this 
fact, the relevant statement of the dynamical inverse problem has to be as 
follows: given operator R^^, recover the potential q Iqct. 



Such a string is described by the equation p{x)uu 
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Figure 8. Incident and reflected waves 



Note in addition that this statement agrees closely with very general 
principles of the system theory [22]. By one of them, the inputf^output 
map of a linear system (here the response operator R^"^) determines not 
the whole system but its controllable part (here the interval fi'^-^) only. 



4.2 Solving inverse problem 

The potential q Jjjct can be recovered by means of the following proce- 
dure: 

Step 1 Determine the constant ^/p and the reply function r |(o,2T) from 
(33). Find the operator C^ by (38). Compose (i?^)* by (32). 

Step 2 Fix ^ G (0,r]. In the subspace J^^'« C T^ , construct a C'^- 
orthogonal basis of controls {sfj^i (see (45)). 

Step 3 Choose two constants a, (3 provided a^ + /?^ 7^ and find the 
value p(c^) by the use of the relation (60) . 

Step 4 Varying ^ £ (0,T], recover the function p in $7"^-^. 
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Step 5 Determine the potential from the equation (53); 

P"{x) 
p{x) 



q{x) = ^^ , a; e r!^^ . (68) 



Surely, using (68) one has to take care of the case p{x) = 0. However, as 
is well known, on any finite interval $7'^"^ there may be only a finite number 
of zeros of p. Therefore, if q is determined in other points it can be extended 
to the zeros by continuity. Another option to remove the zero of p at a given 
X = xqIS to choose another a, (3. 

The inverse problem is solved. 

One more way is to make use of the GLK-equations. One can compose 
the family of the equations (66), find their solutions f — f^, determine 
the values p{c£,) by (67), and hence recover the function p in fl'^'^ . There- 
after, the potential is determined by (68). In [15], such a scheme is real- 
ized numerically for the string of variable density governed by the equation 

pUtt - Uxx = 0. 

So, the dynamical inverse problem stated in sec 4.1 is solved. In such 
a clear and natural form it was set up by A.S.Blagovestchcnskii, who elab- 
orated the so-called local approach solving the problem for the inhomoge- 
neous string and providing the characteristic conditions for its solvability 
[17]. Independently and almost simultaneously, in physical literature there 
was proposed another approach belonging to B.Gopinath and M.M.Sondhi 
[20], [21], which also solves the inverse problem in the above-mentioned 
setup, i.e., preserves the locality of determination of the string parameters. 
Both of the approaches derive M.Krein's equation and apply it for solving 
the inverse problem, the derivation being based upon a purely local (hyper- 
bolic) technique that allows not to invoke irrelevant spectral devices (like the 
Fourier transform w.r.t. time, etc). A peculiarity of the Gopinath-Sondhi 
approach is its straightforward relation to the BCP. 

4.3 Visualization of viraves 

Here we realize the option mentioned at the end of sec 3.1 and show how 
one can make the waves on the string to be visible for the external observer. 
The proposed procedure is a core of the BC-method: it is relevant to a wide 
class of inverse problems including multidimensional ones [4] . [7] . 

Let the observer choose a smooth control / e JF^, so that the corre- 
sponding (invisible) wave u^ {■ ,T) is also smooth in fl'^"^ . 
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Step 1 Fix ^ € (0,T] and construct the system of controls {5^}^! C 
J^'^'^ that produces the (invisible) wave basis by (45). 

Step 2 Determine the operator V^ by (49) and then find the value 
M/(c$,r)by(50). 

Step 3 Varying ^ e (0, T] and repeating the same trick, recover the 
wave u^{x,T) in ff^"^ . 

So, we succeeded in visualizing the wave u-^( • , T)! Moreover, replacing 
^ by i in the delay relation (29), we can recover the wave process in a whole: 



u^(cf,<) = (7'«D^'*/)(T-^ + 0), 0<C,<<T, (69) 



what is much more than just to recover q Iqct. Indeed, as soon as the 
evolution of the wave is seen, the potential can be determined, e.g., from 
the wave equation (1). 



By the way, the representation (69) is quite available for constructing 
numerical algorithms. The only thing, which is needed for the use of (60), 
is to prepare the system of controls {gl}- To this end, one can take a rich 
enough initial system of controls {/|}^i C JF^'^, for instance. 



^^ ' 1 " for other te[0,T] ^ ' 



and then apply the Schmidt process (45). As is well known, such an orthogo- 
nalization is equivalent to inverting the Gram matrix {(C*"^/,^, /,- ):F'^}f'v=l■ 
Since the connecting operator is boundedly invertible in JF^, this matrix 
turns out to be well posed uniformly w.r.t. N that provides the process to 
be stable. However, to recover the potential by the use of (68) one needs to 
invoke the numerical differentiation that complicates the calculations but 
can be realized by standard regularizing procedures. Regarding the use 
of the GLK-equations (66), note that, from the computational viewpoint, 
to solve them and to invert the Gram matrix of the large size is, roughly 
speaking, one and the same [15]. 
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APPENDIX 

I. Structure of fundamental solution The sequence of functions and 
distributions {0^t)}'j^_^, -oo < t < oo, 



f{t) 



0, ^<0 



0^{t)^ / e^-^{s)ds.. 



1, ^>0 

i = ... ,-2,-1,0,1,2,... (71) 

is said to be a smoothness scale: the bigger j, the smoother 6^ and 



«'-«) = ^w. 



'OO < t < oo 



holds. In particular. 



P 



= 5"(t), 9-^ = S'{t), 9-'^S{t), ... , 0-'" = -e°(t) 



Note that all elements of the scale vanish as ^ < 0. 



)"'= 8'(s) 



:6(S) 



e°=y2[i+sigii(s)] 



9' = se°(s) 



e^=fV(s) 



Figure 9. Smoothness scale 



Let us look for the fundamental solution in the form of a formal series 

CO 

u\x,t) = Yl «'(^) ^' (* - -) (72) 



J=-l 



with unknown functions aj , which can be referred to as the Taylor expansion 
of u^ near (from the left of) its forward front t ~ -. The reason to begin 
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the series with j = —1 is that in the unperturbed case one has u^{x,t) = 
S (t — ^) = 9^^ {t — ^). Differentiating and combining the similar terms, 
we easily get 



2 tt 

oo 

E 



^tt - u^x + 9" 



■0-1(2;) 



- aj+i(a;) - a'^i^) + Qix)ajix) 



e^t~- 



By the independence of the different order singularities, we get the recurrent 
system of the ODEs 



-«-i =0, 



I'j - a"-i + qaj-i = , j = , 1 , 



which is often called the transport equations and can be integrated one by 
one. In the mean time, the condition u''(0, t) = 5{t), i > implies the initial 
conditions 

a'_i(0) = 1, a^(0) =0, j = 0,1, ... . 

The integration yields 



a'_i = 1 , a'^lx) 



q{s) ds , 



Summarizing, we arrive at the well-known representation 



\x,t)^s(t-^) 



q{s) ds 



'°0-D 



that describes the behavior of the fundamental solution near its forward 
front i = -. 

c 

The above-presented derivation is often referred to as a dynamical ver- 
sion of the WKB-method, whereas (72) is called the progressive wave ex- 
pansion ^ . The meaning of such a representation can be commented on as 
follows. Even though the shape of a wave can be complicated, the evolu- 
tion of its singularities ((5-terms, jumps, etc) is ruled by rather simple lows 



The terms ray 'method and ray expansion are also in use. 
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(geometrical optics formulas). This fact is also valid for a wide class of 
hyperbolic problems including multidimensional ones. 

II. Proof of (37), (38) Choose smooth controls /, 5 G T^^ such that 
/ vanishes near t — Q and t ^ T, whereas g vanishes near t = 0. Denote 
/_ :— S'^ f £ J-"^^] let u^ be the solution of (l)-(3) and u^^ the solution 
of (21)-(23) such that u\x=o = /-. Note that both of these solutions are 
classical. Also, by the choice of g and the property (17), for every t > the 
solution u^[x, ,t) vanishes in a neighborhood of a; = ct. 

Introduce Blagovcstchenscii's function 



b{s,t) :— u' {x,s)u^{x,t) pdx, {s,t) £ B 



■2T 



in the triangular B^^ := {{s,t)\ < t < T, t < s <2T -t}. Function b 
satisfies the relations 

btt{s,t) - bss{s,t) = 

u*^ {x, s) uff{x, t) — uli. (x, s) u^{x, t)] pdx ~ (see (1)) = 



u^-{x,s) {ul^{x,t)-q{x)u^{x,t)) - 

{uicx {x, s) — q{x)u-'~' (a;, s)) u^{x, t)] dx = 

u^- {x, s) ul^ - ul- (x, s) u^x, t)] dx 





- [uf- (0, s) <(0, t) - ut {x, s) u^O, t)] = 

- /_(s) (R^g) it) + {R'^f^) {s)9{t) =: F{s,t) 

(in course of integration by parts, the terms at a; = cx3 vanish by the above 
mentioned properties of u^). Hence, with regard to the zero Cauchy data 
(2) for u^ , we arrive at the problem 

btt-bss^F in B^^ 
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Applying the D'Alcmbcrt formula, we have 

t S + t-TJ 



h{s,t)^\jdri J Fi^,rj)dt is,t)eB'^. 



s-t+ri 

Putting here s ~ t — T^ we get 
biT,T)^ 

T 2T-r] 



1 



dv / [-f-iO {R'g) iv) + {R^^I-) {Oairi)] dv = 



n 

T 



2T-r] 



dii{R^g){7l) / /-(e)rf? + 



2T-?7 



/ dv airi) 2 / (i?'^/-) (0 d^~J {R'^^n (C) d^ 



L 

By oddness of /_ , we have 

2T-7) 



j /_(e)de = o, o<?7<r. 



Going on the calculation and taking into account the representation 

■ 2T-?) 



j {R^^J^){Odi-j{R^^J^){Od^ 
. 



(see (35)) 



T\* r>2T j2T qT _ 



one obtains 



R^' J''Sif){'q), 0<rj<T 



biT,T) - {-;^{S^) R^'^J^^S^f,g 



JP-T 
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On the other hand, the definition of b and the evident equahty u-^^ {' ^T) = 
u-^ {■ ,T) imply 



b{T, T)^ uf- (x, T) u3{x, T)pdx= (u^(-, T), u9(-, T))^,^ 



see 



(34)) = (C^/,5) 



j^r 



Compare two obtained expressions for b{T,T). Taking into account the 
arbitrariness of the chosen /, g and density of such controls in J-'^ , we justify 
the representation (37). Substituting (33) in it. one can easily derive (38). 

An important opportunity to express the inner product of waves via the 
response operator was found out by A.S. Blagovestchcnskii. 

PART II: 2-CHANNEL DYNAMICAL SYSTEM 
(BEAM) 

5 Forward problem 

5.1 Statement 

Here we deal with an initial boundary value problem (problem 1) for the 
system 

putt - u^x + Au^. + Bu = 0, x>0, 0<t<T (73) 

u|t=o = wt|t=o = 0, x>0 (74) 

uU=o = /, 0<t<T, (75) 

where 

/pi 0\ faiiix) ai2{x)\ fbii{x) bi2{x) 

P \Q P2)' \a2i{x) a22{x))''^ ^i(x) 622(2:) 

are 2x2- matrices with smooth (continuously diffcrentiable) entries, whereas 

pi 2 are the constants provided Q < pi < P2 ^ \ f — f lA] is a boundary 

\hyilj 

\ui{x,t) 



control; u = u^ [x,t) = [ ^ '^:| ) is a solution (wave). 



The case < pi = p2 will be mentioned later in Comments. 
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Also wc impose the following self-adjointness condition on the coeffi- 
cients: 

HA 

A^ix) = -A{x), —{x)=Bix)-B^ix), x>0, (76) 

ax 

where (■■•)' i'' the matrix conjugation; in more detail, 

M^)=[at) '"o"^^)' ^2i(x)-6i2(x) = a'(x), x>0. 

Such conditions come from physics: they provide a relevant energy conser- 
vation low on the beam ^°. 

In parallel we consider the unperturbed problem 1 



pUtt - Uxx = 0, 


x>0, <t<T 


(77) 


u\t=o = Ut\t=o = 0, 


x>0 


(78) 


u\x=o = /, 


0<t<T, 


(79) 



and denote by il^ its solution, which can be found in explicit form: 

uf{^,t) = (j^fjltj) (80) 

(recall the Convention of sec 1.2!), where q :— -i=-, c\ > C2. Such a solu- 
tion describes a wave process in two-channel system, in which two different 

wave modes ^ Vi ) ^^'-^ iff ^ ) Pi'op&g^'te in the first (fast) and 

\ J \U2[X,t)J 

second {slow) channels with the velocities ci and C2 respectively, the modes 
propagating independently (not interacting with each other). 



In contrast to this, in the perturbed system (73)- (75) with non-diagonal 
A and/or B, the modes do interact and we shall see that such an interaction 
provides interesting effects. A classical example of a 2-channel system with 
interacting modes is a Timoshcnko beam in elasticity theory. One more ex- 
ample comes from electrical engineering: a twin-core cable with interacting 
cores. 



However, all results of general character (sec 5.2 - 5.5) are valid for arbitrary smooth 
coefficients A, B. 
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5.2 Integral equation and generalized solutions 

The problem 1 can be reduced to a system of integral equations by 
means of the same trick that was used in sec 2.2. Namely, denoting Qu := 
Aux + Bu, representing u^ = u^ + w, and substituting to (73)-(75) with a 



new unknown vector-function 



wi{x,t) 



we get the problem 



1/ 



a; > 0, < f <r 

x>0 

0<t<T. 



w|t=o = ut\t=o = 0, 

u\x=o = f, 

Applying the D'Alembert formula by components, we arrive at the equation 

w + Mw = ~Mu^ (81) 

with the matrix integral operator A/, which acts by the rule 



{Mw){x,t) 



2-fe// {Qw)^{tTl)dS.d^\^ 



x>Q. 



and Kci {x, t) are the trapeziums bounded by the characteristic lines t± — 
const . 




ST c,T 

Figure 10. Domains Kf.^{x.t) 



i 



The equation (81) is a second kind Volterra- type equation which can be 
analyzed by iteration method. The analysis provides the following results 
(see [16], [31] for detail): 
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• If the control / is a twice continuously diffcrentiable (vector-) function 
provided /(O) = /'(O) = /"(O) = 0, then one can check that the 
solution w^ of (81) is also twice diffcrentiable w.r.t. x,i, whereas the 
function u^ = u^ -\- w^ turns out to be a unique classical solution of 
the problem (73)- (75). 

• As is easy to see, for f E L2 ((0,r);IR.^)^^, the r.h.s. of the equa- 
tion (81) is a continuous function of the variables x,t, the function 
vanishing for t < —. Simple analysis shows that the equation is 
uniquely solvable in this class of functions. In this case, the function 
u^ :— u^ + w^ is regarded as a generalized solution of the problem 1. 
So, just by the accepted definition such a solution does always exist 
and is unique. 

5.3 Fundamental solution 

For the problem 1, the relevant analog of the scalar fundamental solution 
u is the solution of the matrix problem 

pUtt " U,^^ + AU,^ +BU ^0, x>0, 0<t<T (82) 

U\t^o = Utlt^o ^ 0, x>0 (83) 

C/.=o = <5(i) ("J JV 0<t<T, (84) 

where 6 is the Dirac delta-function, U = U{x,t) is a 2 x 2 - matrix-function. 
In the unperturbed case A = B = holds and one has 

Uix,t)^ ('^^*0"^ Sit^f))- (^^) 

In the perturbed case, representing 

U ^ U + W, (86) 

one can reduce the problem to a relevant version of the matrix integral 
equation (81) for the regular part W and analyze it by the iteration method. 
Omitting the details, let us present some results. 

1. The solution W{x, t) is a piece-wise smooth matrix-function, whereas 
possible breaks of smoothness (jumps and/or jumps of derivatives) can occur 
on the characteristic lines t — ^ only. The solution is supported above the 
fast characteristic: 



W{x,t) 



(87) 



t<^ 



It is the class of vector-functions satisfying J„ (/i(t)) -I- (/2(t)) dt < 00. 
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and satisfies 

W{0,t) =0, t>0. (88) 

2. To describe a structure of the fundamental solution let us note that 
the columns of the matrix U coincide with the solutions of the problem 1 

corresponding to the controls p. ] ^-nd „ , , 



U{x,t) 



( ^ \ 

u} ' {x,t) u}- ' {x,t) 
'6\ /O' 



\U2 (x, t) 1*2 {x, t)/ 



(see (85)) 



[ W2l{x,t) S{t-f)+W22{x,t))- ^^^^ 



By (87), the fundamental solution vanishes under the fast characteristic 
i = — ; typical behavior of its components is illustrated on Fig 11 and 
commented on below: 

• Fig 11a The control I _ j injects the singularity into the first (fast) 

channel. The singularity of the form S{t — — ) propagates in this 
channel with the velocity ci and interacts with inhomogcneities of the 
beam. As result of interaction, a tail appears. 

• Fig lib Owing to interaction between channels, the fast component 

u-^ ' induces a wave into the second (slow) channel, the wave prop- 
agating with "anomalous" fast velocity c\. The part of the compo- 

;») 

nent Uj , which appears as result of the inter-channel connection, 
is called a precursor ^^ (shaded on Fig lib) Also note that in the 
case oi A = the second component turns out to be smoother: it is 
continuous (has no jumps) but may have jumps of derivatives at the 
characteristics t = —. 

Fig lie The control I „ ] injects the singularity into the second (slow) 

channel. The singularity S{t— —) propagates in this channel with the 
velocity C2 . Inhomogcneities of the beam yield appearance of the tail 
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s(t-f) 



^\j\r~^ 




^^{oo,t 




Cjt X 



♦u®(^,t) 







s(t-f) 



^\j\r-^ 



c.t 



Figure 11. Fundamental matrix solution 



Ci oc 



behind the leading singularity. Since the channels do interact, the slow 
mode induces a wave process in the fast channel, which propagates 
with the velocity c\ and, hence, leaves behind the slow singularity. In 
the mean time, such a process excites the waves in the slow channel, 
what implies the appearance of the precursor (shaded) propagating 
with anomalous fast velocity. 

(°) 

Fig lid The fast component of the wave uV / appears as result of 
the inter-channel connection. 

In the case A = the interaction between channels is weaker and 
the first component turns out to be smoother: it is continuous (has 
no jumps) but may have jumps of derivatives at the characteristics 



It is worthwhile to note that the leading singular part of the fundamen- 
tal solution is the same in the perturbed and unperturbed case (sec (89)). 
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Also some jumps (as well as jumps of derivatives) shown on Fig 11, can 
occasionally vanish that depends on the behavior of the coefficients A, B. 



As we see, the interaction between channels renders the picture of waves 
on the beam much more complicated than one on the string. The technique 
that provides the picture shown on Fig 11 and computes the jumps of the 
regular part of the fundamental solution is in principle the same as the 
technique developed for the scalar case in Appendix I. Namely, by analogy 
to (72) one can seek for U in the form of the expansion by smoothness 



Uix,t) 

To 



-) 







oo r 

E 



a,ix)e^(^t-^^ 



+ b,{x)eUt-- 



C2 



(90) 



with (unknown) matrices aj,bj, then substitute (90) in (82) ~ (84) and 
determine the matrices by means of the separation of singularities. However, 
since the calculations are rather cumbersome, we omit them and recommend 
the reader to fill this gap as an exercise. As example, the first coefficients 
of the series look as follows: 



aa{x) 



bo{x) 



-^ f ^^a2(s)+6n(s) ds -^^a{0) 



P2-P1 



a{x) 



P2-P1 "- ' ^Jj2 J 



/P2 



a(x) 



P2-P1 



ds 



Recall that a{x) is the matrix clement of the coefficient A{x): see (76). 



5.4 Properties of waves 

Since the problem 1 is linear and the coefficients A, B are time-independent, 
the superposition principle does hold: by the Duhamel formula, for a control 
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f e L2 ((0,r);M2) one has 

u^x,t) = U{x,t)*f{t) = (see (89)) = 
t 

f^f^Zl]) + J ^(^'^ - ^) fi^)ds = (see (87)) = 



Such a representation easily implies the following properties of the solution 
uf : 

• The relation 

uf \t<f- =0, a;> 0, t > 0, (92) 

which is interpreted as the finiteness of the wave propagation speed 
on the beam, holds. 

• For / e L2 ((0,T);R2) and r G [0,T], define a delayed control /^ S 
L2((0,r);M2)by 

/r(t) := /(t-r), 0<t<T, 

where r is the value of delay (recall the Convention!). Since the coef- 
ficients A, B do not depend on t, the delay of the control implies the 
same delay of the wave: the relation 

u^^{x,t) = iJ{x,t-T) (93) 

is valid. 

• Let a control / be a piece- wise continuous vector-function, which has a 

jump at i = ^. By the properties of the regular part of the fundamental 

solution W, the integral term in (91) depends on x, t continuously. 

/fit— —\\ 
Therefore, u^(x,t) and the term [ „ ,, '^i n I have one and the same 

jumps at the characteristic lines t ~ — + £,, the values of the jumps 
being connected (by components) as 






u{{x,t) 
So, the jumps propagate along the fast and slow characteristics see Fig 12a. 



s=i+o 

t>0, i^l,2. (94) 

s=^-0 
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Figure 12. Propagation of jumps 



For what follows it is convenient to write (94) in the following equivalent 
form. Assume that at the final moment t = T the wave u-^ {■ ,T) turns 
out to be a piece-wise continuous, with a jump at the point x ~ C2^. As 
is easily seen from (91), such a case is possible only if the control / has a 
jump in both components, the values of jumps being connected through the 
relations 



m((x,T) 



X=C2^ + Q 



X — C2^ — 



/l(^) 



1 


^2 (a^, T) 


X = C2^+Q 


- f2{s) 


s=T-(,-0 


T-§fC+0 




X=C2i-0 




s=T-C+0 

(95) 



(see Fig 12b). 



5.5 Extended problem 1 and locality 

As well as in the case of a string, the finiteness of the wave propagation 
speed leads to the local dependence of the solution of the problem (73)-(75) 
on the coefficients A, B. Namely, by the arguments quite analogous to ones 
used in sec 2.5, the following extension of the problem 1 turns out to be a 
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well-posed problem: 

putt - Uxx + Aux + Bu^Q, 
u\t<^ = 

u\x=o = /, 



(x,i)eA^^ (96) 

(97) 

0<<<2r, (98) 



where A^^ := |(a;,t) | < x < ciT, Q<t <2T ~ ^\ (sec the shaded do- 
main on Fig 13). The solution u^ of the problem (96)-(98) is determined 



2T 



f ' 




f 




^\^ 


\ */o-v/ 


\ // / 


\ Xo / 


\ / 


>c ^ 




2T/ ^-^ 


A/ .^; 




/ .' 




/ y 




/ /^ 




/ ^ 




//'''o 




y 


► 



Figure 13. Extended problem 



by the values of the coefficients A^B |o<x<ciT only (do not depend on the 
behavior of A, B \x>cit)- 

Solving the matrix problem 

pUtt - U^x + AU, + BU^O, {x, t) e A2^ 

U\t<^ = 



u u=o= m 



1 
1 



< t < 2r, 
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we get the fundamental solution of the extended problem. As is easy to see, 
for times t < T it coincides with the above-introduced fundamental solution 
of the problem 1, whereas the Duhamel representation 

u^ix.t) = U{x,t)*f{t) = 

t-f- 

(/^(l:tj)+ / W{x.,t-s) (^^g) ds, (x,t)eA^^ (99) 

holds and extends (91). 

5.6 Matrix reply function 

Choose a smooth control / provided /(O) = 0. Differentiating in (99) 
w.r.t. X and putting x = 0, one can get the representation 

u^(0,t) =[/,(0,t)*/(t) = 
t 

- yrpf{t)+Lo!{t) + j r{t-s)f{s)ds, 0<<<2T, (100) 





pT 0\ ,^,._/ -,T^«(0) 



where 



arc the constant matrices, whereas 

-W-f'"S '"mV o<t<2r 

is a smooth matrix- function ^'^, which is said to be a reply function. It will 
play a central role in the inverse problem for the beam. By the locality prin- 
ciple, the reply function r |o<t<2T is determined by the parts A, B |o<z:<ciT 
of the coefficients. 

We omit the proof of (100) and recommend the reader to fill this gap 
himself. The helpful remark is that to derive (100) one needs to represent 

ui{Q, t) = (see (91)) = C/,(0, t) * f{t) , < t < 2T , 



The case r(+0) ^ is possible, i.e., the r(t) may have a jump at t = only. 
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find Ux{x,t) by difi^crentiating the representation (90) in the sense of dis- 
tributions, put there x = 0, and substitute to this eonvolution. The cal- 
culations are rather cumbersome and must be implemented accurately ^^. 

In addition, the matrix reply function possesses the following important 
property. As can be shown, the self-adjointness condition (76) implies 

r\t) = r{t), t>0. (101) 

To prove this relation one can just apply the trick used in [1], Lemma 2.2, 
p433. 

5.7 Slow waves 

Here we describe a nice physical effect, which is a specific feature of 
two-velocity systems. This effect will play a key role in solving the inverse 
problem. 

Return to the system (73)-(75). li A = and B is diagonal (i.e., &12 = 
&21 = 0), then the problem is decoupled to a pair of the scalar problems for 

f f fu^\ / \ 

the components u\^{x,t) and U2{x,t). The modes -'^ I and y j do not 

interact and propagate in the channels independently with the velocities ci 
and C2 respectively, the second (slow) mode vanishing for t < —. 

An interesting and rather unexpected fact is that the slow waves satis- 
fying 

uf \t<^= (102) 

do ever exist: in spite of the inter-channel connection, a certain mixture of 
the modes can propagate along the beam with the slow velocity C2 ! Consider 
the problem (73)-(75) with T = 00; as is shown in [10], [12], there exists a 
unique smooth function l{t), t > such that the solution u-^ satisfies (102) 

if and only if the components of the control f = { r ] arc linked as 

t 
h{t)^{l*h)[t)^ j l{t~s)f2{s)ds, t>0. (103) 


In particular, taking /2 — 6{t) we get /i = l{t), whereas the corresponding 



wave u V / is supported in the shaded space-time domain on Fig 13a. It is 



In the paper [12], the term ujf{t) is missed by mistake! 
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the wave, which is a mixture (of the fast and slow modes) propagating with 
the velocity C2- 



u7(a;,t) 



T-c'i 





(i-i:)T 




Figure 14. Slow waves 



We say the function / that governs this effect to be a delaying junction. It 
depends on the coefficients locally: for any T > 0, the values I lo<t<(i-3^iT 
are determined by the coefficients A,B |o<rr:<c2T (see Fig 14b). Therefore, 
in the problem (73)-(75) with a finite T > 0, to produce the wave satisfying 
(102) one needs to impose the connection (103) between the control com- 
ponents on the interval 0<t< (1 — ^)T only, whereas /i, /2 l(i_sa)T<t<T 
can be arbitrary: varying these values, one cannot violate (102) just because 
the maximal wave propagation speed is c\. 

Fix a positive ^ < T. Combining the above mentioned properties of slow 
waves with the delay relation (93), we easily conclude that the solution u^ 
satisfies 



,/ 



t<T-i+- 







(104) 
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if and only if the control / satisfies 

/ |t<T_j= and 
t 

h(t)^ j l{t-s)h{s)ds, T-e<i<r-^C- (105) 

J Ci 

Such solutions are supported in the domain shaded of Fig 14c and can be 
regarded as " delayed slow waves" . 

6 Beam as dynamical system 

6.1 System a^ 

Here the problem (73)-(75) is regarded in terms of the control theory. 
We endow it with standard attributes of a dynamical system: spaces and 
operators. The system is denoted by a^ . 

Outer space By angle brackets we denote the standard inner product 

in M?: for a = ( ) i ^ = ( j, I we put (a, b) := ai&i + a2b2- 

The Hilbcrt space of controls (inputs) T^ := L2 ((0,T);M2j ^ith the 
inner product 

T 

{f:9)r- ^ J{fit),g{t))dt 



is said to be an outer space of the system a'^ . It contains an increasing 
family of subspaces 

consisting of the delayed controls (T — ^ is the value of the delay, ^ is an 
action time). Note that J^"^^" = {0} and J^^'^ = T'^ . 

A specific feature of a two-velocity system is that there is one more 
increasing family of subspaces in J-''^ related to slow waves: 



^i 



■T.i 



f-[f^]e:F^'^ 



h{t)= j l{t-3)h{s)ds, T~i<t<T-^f^i 

(106) 
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In accordance with (104) and (105), the wave u^ satisfies (104) if and oniy 
if the control / belongs to the class J^i ■ Also, notice that JF; <Z T '^ . 

Inner space Recall that by Vf^ we denote the interval (0,a) of the 
semi-axis x > 0. The space Ti'^^'^ := L2,p {ft'^'^'^) with the inner product 

ciT 

(u,w)^ciT = / {pu{x),v{x))dx = / [piUi{x)vi{x) + p2U2{x)v2{x)] dx 

is called an inner space of the system a^ . For each / G T'^ , at any moment t 
the wave u-f{-,t)is supported in Q'^^'^ (see (92)) and, hence, can be regarded 
as a time-dependent element of Ti'^^^ . In control theory, u^{- ,t) is referred 
to as a state of the system at the moment t. So, W^^"^ is a space of states. 
The inner space contains two increasing families of subspaces 

H^'^ -.^ {yeH^'^l 2/U>c.«=0} 0<e<T, i = l,2 

of vector-functions supported in the intervals $7^^^ and ft''^^ respectively. 
Notice that H"^^ C H"^^ for every ^ > 0. In accordance with (92), n=i^ 
can be regarded as an interval of the beam filled with waves at the moment 
i = ^, whereas by (102) i7^^^ is the interval filled with slow waves. 

Control operator The input i-^ state correspondence of the system 
a^ is realized by a control operator W'^ : J-'^ — > Ti.'^^'^ , 

W^f ;= M-^(-,T) 

which creates the waves. By (91), it can be represented in the form 

{W^f){x) = (jl^(!^Zly) + J W{x,T-s)f{s)ds, (107) 

which shows that W"^ is a bounded operator. 

Combining (92) with (93), it is easy to see that the condition / \t^x-i= 
implies u-f{ ■ ,T) \x>ci^= 0, i.e., the embedding 

p^TjpT,? ^ -^ci5 ^ < ^ < T (108) 

holds. In the mean time, taking / G JFj and putting t = T in (104) we 
see that u^{- ,T) \x>c2i= 0, i.e., uf{- ,T) is supported in fJ"^^^ and hence 
we have the embedding 

W'^tJ' ^ C W^^ , < e < T (109) 
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Wc shall sec that the character of these relations is different: the first em- 
bedding is strict, whereas the second one is in fact an equality. 

Response operator An input i-^ output correspondence in the system 
a^ is realized by a response operator R^ : T'^ — > T"^ which is defined on 
controls / of the Sobolev class B^ ([0, T];"^^) ^^ provided /(O) = and acts 
by the rule 

(i?^/) (t) := ui{Q,t) = ( jl^^'^M , < t < T. 

Using (100) for times < t < T, we can represent 

t 
(i?^/)(i) = ~^f'it)+Luf{t)+ fr{t-s)f{s)ds, 0<t<T. (110) 



It is easy to check that the adjoint operator (-R^) is well defined on 
controls / € H^ ([0,r];R^) provided f{T) — and acts in the outer space 
by the rule 



T 



^pf{t)+Jf{t)+ r\s-t)f{s)ds, 0<t<T. (Ill) 



With the extended problem 1 (see (96)-(98)) one associates the extended 
response operator R^^ , which acts in the space J-^^ :— L2 ((0,2T);M'^) on 
controls / of the class H^ ([0, 2T]; M^) provided /(O) == by the rule 

{R'^f){t) :=ui{0,t), 0<t<2T, 

where u^ is the solution of (96)-(98). In accordance with (100), the repre- 
sentation 



{R^^f){t) = -^f{t)+ujf{t)+J r{t-s)f{s)ds, 0<t<2T (112) 



is valid. By the locality principle, along with the reply matrix- function r 
the operator R^^ is determined by the part A, B IfjcjT of the cocfhcients. 



'For controls of this class, both components belong to the scalar Sobolev space H^ [0, T]. 
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Connecting Operator Since the control operator W'^ acts from a 
Hilbcrt space J^^ to a Hilbcrt space Ti.'^^^ , its adjoint (Vt^"^) acts from 
TY^i-^ to JF^. By this, an operator 

is well defined. It acts in the outer space J-^ and is said to be a connecting 
operator of the system a^ . Since W"^ is a bomided operator, C"^ is also 
bounded. By the definition, one has 

= {uf{;T),U<>{;T))^^^^ , (113) 

i.e., C"^ connects the metrics of the outer and inner spaces. 

The following remarkable fact occurs: ij the self-adjointness condition 
(76) is fulfilled then the connecting operator can be explicitly and simply 
represented via the response operator, the representation being of the same 
form as in the scalar case: 

where the operators S'^ and J^^ are of the same meaning as in the case 
of the string: see (37). Moreover, the representation through the reply 
function 

T 

{C^f){t)^^pf{t)+ j c^{t,s)f{s)ds, Q<t<T, (115) 



with the matrix kernel 

2T-t-s 

c^{t,s) := - I r{ri)dri 0<s,t<T 

\t-s\ 

also remains in force ^^. To establish (114) one can just repeat the derivation 
of (37) described in Appendix II of Part I, i.e., to introduce the Blagovest- 
chcnskii function 



6(s,i) := j {piJ-{x,s),u^{x,t)) dx, {s,t) e B' 





2T 



Recall that in the self-adjoint case the reply function is symmetric: see (101). 
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and then derive the equation btt — &s 
use of the equahties 



/ 






Aix) 



dx 



B(x) 



— F by integration by parts with the 
u{x), v{x) ) dx = 



[{U:c, V) - (U, V^) - {U, A^V)] 

d 



u{x), 



dx"^ 



A\x)- + {Ai{x)-BHx)) 



dx 



v{x) ) dx 



- K(0),«(0)) + W0),«,(0)) + (w(0), A»(0)«(0))- 



u{x), 



dx^ 



A{x)---Bix) 
dx 



v{x) \ dx 



(116) 



for smooth vector-functions u, v vanishing for large x's. The reader is 
strongly recommended to restore all details of the derivation. 

The significance of the formula (114) for the inverse problem is the same 
as in the scalar case: we shall see that it is the relation, which enables the 
external observer to visualize the waves in the beam through the measure- 
ments at the endpoint x = 0. 

The diagram on Fig 6 is also quite available for the system a^ corre- 
sponding to a beam. 

6.2 Controllability 

The relevant statement of the boundary control problem (BCP) for a 
beam is the same as for a string: given a vector- function y E Ti.'^^'^ to find 
a control f E JF^ such that 

uf{x,T) = y{x), xeQ"'^. (117) 

However we shall see that the character of controllability of a beam differs 
essentially from the one of a string. 

By analogy to the scalar case, introduce a reachable set 



w 



The principal fact is that the latter embedding is now strict: in contrast to 
the property (43), on the beam we have 



W ^ W^' , r>o. 



(118) 
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Moreover, the subspace H'^^^ QU'^ 7^ {0} turns out to be of infinite dimen- 
sion, i.e., in the inner space TL'^^'^ there exist infinitely many vector-functions 
y orthogonal to all waves^^. So, on a beam the local boundary controllability 
does not occur! 

Such a fact should be expected. Indeed if ^ = and B is diagonal, 
the channels do not interact and, for any control /, the second component 
U2( ■ ,T) is supported in the interval Vl'^'^^ filled with slow modes (see Fig 
16a) 



u; («;,T) 









a) {^b) ( c 

Figure 15. Lack of controllability in Q!^^'^ 



Therefore, the vector-function y 



where -0 is an arbitrary scalar 



function supported in 51'^i-^\ fi"^^-^, is automatically orthogonal to the waves 
(see Fig 16a) and hence do not belong to the reachable set. 

The same effect occurs if the channels interact, what can be explained 
as follows. For any wave u^ {■ ,T), the precursor U2 ( • , T) \c2T<x<ciT hi the 
slow channel is induced by the head part of the component m( ( • , T) \c2T<x<ciT 
in the fast channel and is determined by this head part. More precisely, there 
exists an operator K^ : L2{c2T,ciT) -^ L2{c2T,ciT) that maps the head 
part to the precursor, the operator being one and the same for all controls: 



Such functions are referred to as unreachable states. 
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see [11] for detail. By this, for the functions ipi, tp2 supported in fl'^'^'^ \n'^^'^ 
we have 



"''■■-'^&))„. 



ciT 



P\J, 



({x,T)\ fMx) 



C2T 



ui{x,T)J ' \Mx) 



dx 



ciT 



C2T 

ciT 



u{{x,T) \ f^^{x) 



C2T 

ciT 



'^U'-u{i.,T))ix)i\M^)r''^ 



Pi u( (x, T) V-i (x) + p2 [k^u{ ( • , T)) (x) V2 {x) 
Pi^i{x)+P2 ((A''^)*^2J (a;) u{{x,T)dx. 



dx 



C2T 



Therefore, taking ipi = — ^ i^'^'^) V'2j we get 



pi 



(»'<-^^''ft 



■H-i^ 



V^i 



for all / e j^^, i.e., f JM ± Z^"^ (see Fig 16b). 

A remarkable fact is that the local controllability can be restored if one 
uses the slow waves. Namely, introduce the reachable set of the form 

Uf := W^T'[''^ = (see (106) for ^ = T ) = 



\-,T) 



f= (f) : fi{t) = Jlit~s)f2is)ds, 0<t<T-^J 



By the definition and properties of the slow waves, each u^{- ,T) is sup- 
ported in fi"^^-^ (see Fig 14b and Fig 15c) and hence we have an embedding 
uf C Ti'^^'^ ■ However this embedding is in fact the equality: 

uf = n"^^ , T>0. (119) 

To prove (119) let us consider an auxiliary problem 

pvtt ~ v^^ + Av,; + Bv = 0, <x < C2T, 0<t <T (120) 

v\t<^=0, (121) 

v\t=T = y, 0<x<C2T (122) 
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with y £ 'H'^^'^ . Such a problem is wch posed. Indeed, it differs from the 
problem (73)-(75) just by the role of variables x and t: in (120)-(122) 
it is natural to regard a; as a time, whereas t plays the role of a space 
coordinate (see Fig 16). Let v = v^{x,t) be the solution; denote / := 



yi(^) 




(l-c^)T 




Figure 16. Auxiliary problem. Controllability in H.'^^^ 



vy{0,t), <t <T. Comparing (120)-(122) with (73)-(75), we easily see 
that uf{x,t) = vy{x,t) for all x € rj'^^r, < t < T and the relation 



uf{x,T) = y{x), xe n"'-^ 



(123) 



holds. Also, since the solution u^ vanishes as i < — (see (121) ), it is a 
slow wave and hence the control /, which this wave is produced by, has to 
belong to the subspace J^^ ' . 

Resuming the aforesaid, we conclude that for any vector-function y g 



■T,T 



Ti.'^^ one can find a unique control f <E Ti' such that (123) is fulfilled. 
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Hence, for such y's, the BCP (117) is uniquely solvable and we arrive at 
(119). Thus, by making use of controls of the class JF; ' we restore the 
local boundary controllability of the system a^ hut on the "slow" intervals 
fi^^"^ only! 

For intermediate times t — ^, let us introduce the "slow" reachable sets 
Uf :— W^J-i . With regard to the delay relation (93), they can be also 
specified a.s Uf ~ {u^{- ,£,)] f € J^i ' }. Since the final moment T in the 
equality (119) is arbitrary, it can be written in the form 

W'^T^- ^ = n"''^ , < C < r , (124) 

which is convenient for what follows. In operator terms, (124) means 
that the control operator maps every subspace J^i ' onto the subspace 
T-C^^^ isomorphically, whereas the BCP (117) for y G W^^^ is solved by 

6.3 Wave basis 

Here, taking into account the specific character of the controllability of 
a beam, we construct the slow wave bases in the reachable subspaces T-C^^^ 
and the corresponding projection operators. 



1. Fix a positive £, < T and choose a basis of controls {/|}°^i in the 

subspace ^i ■ By (124), the corresponding waves {u-^^ {■ ,T)}°°^^ 

constitute a basis of the subspace H'^^^. 

2. Apply to the basis {/|}°^i the Schmidt orthogonalization process 

w.r.t. the bilinear form {C^ f,g)jrT'- 






J^T 



{c^gl 



j^T 






c^-girg^ 



(125) 
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As a result, wc get a new system of controls {gf }^i, which is also a 
basis in J^; and satisfies 

{c'^9i-9f)^^^SM k,l = 1,2,... (126) 

by construction. 

The corresponding waves {wllfe^i , u^. := u^^ {■ ,T) = W^ g\ form 
an orthonormalizcd basis in the subspace Ti^^^ . Indeed, we have 

(see (126)) = 4i fc,Z = l,2,... . 

We say {u|}^j^ to be a slow wave basis in W^^^. 

Of course, from the computational point of view, this orthogonal- 
ization process is more complicated than its scalar analog (see sec 3.3) 
because one deals with the vector-functions. However, as well as for a 
string, to construct the system {g^j^x is in fact to invert the Gram 

matrix < {C'^ f^, fnjrr > for large A^'s. Since C^ is a boundedly 

. ■'J i,j=l 

invertible operator, the inversion turns out to be a well-posed proce- 
dure^^. 

3. The basis {^fl^i C J-i determines an operator Vf , which acts in 
the external space T^ by the rule 

T'ff-f:(c''f,9i)^^9i (127) 



k=i 



and will work in the inverse problem. Note that, for any control /, its 
image Vif belongs to the subspace J^i . Moreover, it can be shown 
that Vi is a skew projection in JF^ onto J^j '^ ^^. 



Such a well-posedness is a specific feature of the one-dimensional (w.r.t. the space 
variable x) problems: in the multidimensional case the situation is much worse and 
complicated (see, e.g., [7]). 
In the mean time, it can be interpreted as the orthogonal projection w.r.t. the inner 



product {C'^f,g)j,T- sec [12]. 
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6.4 Truncation and AF 

As in the previous section, we keep ^ S (0, T] fixed. In the inner space 
Ti,^^^ , introduce the operation P'^^C 



y{x) , X e n"^^ 



{P'^'y)i->-, ^ _^,^ 



that truncates functions onto tlie subinterval Qf^^^ C Vt'^^'^ fiUcd with slow 
waves. It is easy to recognize that P'^^? jg w-^q orthogonal projection in TC^^^ 
onto the subspace Ti'^'^^ . Therefore, expanding over the slow wave basis, we 
can represent the truncated function in the form of the Fourier series: 

oo 

p=^S = E(y'"i)«.,.4- (128) 



k=l 



The truncation projector is related to the above-introduced operator 
(127) as follows: for any / G T^ the equality 

P'^^^W'^f = W'^VU (129) 

is vahd. Indeed, taking y — W'^ f in (128) and recalling that m| :— W^gl, 
we have 

C30 

P'^^^W^f = Y. (^^/' ^^si)^^^, W^gi = (see (113)) 



fe=l 



oo 



W" E (^"^Z ' 5^) .. al = (see (127)) = W^^vU . 
fc=i 



Choose a control / G T^ to be a continuous vector-function and write 
(129) in the form 

P^^^u^i-.T) = u^'/(-,r). (130) 

The truncated wave in the l.h.s. is a piece-wise continuous function, which 
vanishes in ^^^^ \ $7"^^^ and has a jump at the point x = C2^, the value 
of the jump being equal to u^{c2£,,T). On the other hand, by (130) '^'^ 
the truncated wave is also a wave produced by the control Vif G Ti , 
which vanishes for i < P — ^ and has the certain jumps at the moments 



and, eventually, by the controllability (124)! 
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t = r — ^ and t ^ T — —^. The control and wave jumps are related through 

(95). Applying this relation for the control Vf f, we easily arrive at the key 
equalities 



ui{c2£.,T) 









0<e<T, 



(131) 



which represent the values of the wave u-^ {■ , T) on the interval fi^^T gUcj 
with slow waves, through the operator V(. 

As in the scalar case, we call the representation (131) the amplitude 
formula (AF). It represents a wave through the amplitudes of jumps of 
control, the jumps appearing as result of projecting the control on the sub- 
spaces T'^'^ by the operators V^ . The background of the AF is geometrical 
optics; various versions of this formula play the key role in the BC-method, 
including its multidimensional variants [7], [12], [6]. 

6.5 Special BCP 

In this section we deal with the problem (117) with a special r.h.s. y. 
The self-adjointness conditions (76) are assumed fulfilled. 
Consider a Cauchy problem 



- p" + Ap' + Bp ^ , x>Q 

p U=o= a, P |x=o= P 



(132) 
(133) 



with the given vectors a, /? e R^. By standard ODE theory, such a problem 
has a unique smooth solution (vector-function) p — pafj(x). In what follows 
we assume a, f3 to be fixed and write just p{x). 

The analog of the scalar special BCP (55) for the beam has to take into 
account the character of its controllability, i.e., the relation (119). Since the 
waves produced by controls f ^ T^ ' are slow, at the final moment t = T 
they are supported in fi'^^-^. By this, the relevant set up is as follows: to 



T.T 



find a control f ^ T, ' satisfying 



U-' {x,T) = p{x) 



xe n"''^ 



(134) 



By (124) for ^ 



T, such a problem has a unique solution that we denote 



Boundary Control Method 



141 



In the case of a beam, the representation of the function p 



Pi[x) 
P2{x) 



which generahzes (60) and is available for the inverse problem, is the fol- 
lowing: 



P1{C2^,T) 


= 
















{R-y 


+ A«(0) 


T 

K a 


+ 


>c^0 


^i),. 


aim 


P2{c2i,T) 


= 














\k=l 


{R-r 


+ A«(0) 


T 

H a 


+ 


.^P 


9i)^r 


gim 

' 2 


0<C<T 




















t=T-f+0 



(135) 



Let us derive it. 

Applying the AF (131) for f = f^ wc get 



P2{c2^,T)^[{vff)^it) 



CI ■> 

t=T-i+0 



0<^<r, (136) 



where 



Vff ^ 



fe=i 



{c^f,9i) 



J^T 



9i 



(137) 



(see (127)). The coefficients of the series can be found as follows. Let us 
take a smooth control g E Ti ' provided g{0) — g'{0) — g"{0) — 0, so that 
the corresponding wave u^ satisfies (73)-(75) in the classic sense. Such a 
wave is slow: it vanishes at its forward front t — — along with derivatives; 
therefore, at the final moment we have 



u%x,T)\., 



>C2T- 



\C2T,T) = u%{c2T,T) = 0. 



(138) 
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This enables one to justify the foUowing calculations: 

{C^f, g)^^ = (see (113)) ^ (u^^ ( • , T), ««( • , T))^^^^ 



(see 



C2T 

(134), (138))= j {pp{x),u%x,T)) 



dx 




T 



(by zero Cauchy data (74) for the wavew^) = 
PP{x) , j {T -t) uftix, t) dt\ dx = 

C2T 

dt{T-t) j {pix),puUx,t)) dx = (see (73)) = 



T C2T 

Jdt{T-t) J {p{x) , «L(a;, t) - A(xX(a;, t) - B{x)u9{x, t)) dx 



(see (116), (138)) = 

T 

\T-t) [(-p(0),(i?^g)(i)) + (p'(0),3(t)) + (A«(0)p(0),g(0)] dt 



= ( - (i?^) * + A' (0) x^a + x"^^ , .9) _ . (139) 

Thus, we have got the cquahty 



' J^-^ 



{C^f,g)^.^{[-{R'y+A\Q) 



K a + X P , g 



J^T 



(140) 



for classical g's. Since such controls constitute a dense set in T^ , the evident 
limit passage extends (140) to arbitrary g € J^^ ' . 

Taking g = gf, in (140) and returning to (137), we obtain 



{-Pff) it) = 

00 

5:([-(i?-)*+A«(o) 



fe=i 



T 

K a 



+ ^^f^^9i)^^9Ut), 0<t<T. 



(141) 



Summarizing the above considerations, we substitute (141) in (136) and 
arrive at (135). 
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The significance of the formula (135) is that it expresses the fimction p, 
which is an object of the inner space, in intrinsic terms of the outer space. It 
is the fact, which renders this representation relevant to the inverse problem. 

In the scalar case, the solution of the special BCP satisfies the Gelfand- 
Levitan-Krein equation (61). Does there exist the relevant analog of this 
equation for the beam? In other words, can one extend the Sondhi-Gopinath 
approach to the case of 2-velocity system? The answer is affirmative but 
however such an analog turns out to be of rather complicated form. Namely, 
the relation (140) easily implies that the control / = f'^ is a unique element 
of the subspace T, ' , which satisfies the equation 



rfC^f = Vf { [- {R^y + A^Q)] K^a + K^td] (142) 



(see [14]). 



7 Inverse Problem 

7.1 Statement 

The external observer applies controls fit) |(o,2T) a-t the endpoint x — Q 
of the beam and measures u^(0,i) |(o,2T) a-t the same point. As result 
of such measurements, the observer possesses the response operator B?"'" 
(see (112))^^. In accordance with the locality principle, the operator R^^ 
is determined by the coefficients A,B Ijjcit. Therefore, at first sight by 
analogy to the scalar case, the inverse problem should be: given i?^-^, recover 
A,B IfjciT. However, such a statement is not quite natural and has to be 
modified by the following reasons. 

First, let A = 0, S = ( _ , ) . In this case, the wave modes do 

\ U 022/ 

not interact, the beam is decoupled to two independent channels with the 
scalar potentials hn and 622 respectively. Correspondingly, the reply func- 
tion takes the form r = f „ 1 . By the results of Part I, the func- 

V r22j 

tion rii(t) |(o,2T) determines 611 Iqc^t, whereas r22(t) |(o,2T) determines 
^22 lo<:2r- However, such data do not determine 622 in ^^'^ \ tt'^'^^l Hence, 
in the general case of the interacting channels, we should not expect that 
the operator i?^-^ determines A, B in Vl"^^"^ \ Vt'^^'^ uniquely. 



The same information is obtained if one applies the controls I p. i ( ^ 1 and measures 
Ux(0,t), 0<t<2T. 
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Second, the object we are going to recover is a pair of matrix-functions 
A, B satisfying the self-adjointness conditions (76). Such a pair is deter- 
mined by four independent scalar functions (parameters) : the matrix ele- 
ments ai2, fell, &12, &22- In the mean time, the response operator is deter- 
mined by the symmetric reply matrix- function r, which contains only three 
functional parameters rii,ri2,r22 (see (112)). Therefore, to hope for the 
uniqueness of determination is not reasonable and we need to supplement r 
with 1-paramcter data. 

A well- motivated choice is to add the function I, which governs the slow 
waves, and set up the dynamical inverse problem as follows: given the 
response operator R^^ and the delaying function / | / ^2^^^ of 

the system a^, recover the coefficients A,B in £7"^^^. 

7.2 Solving the inverse problem 

The coefficients can be recovered by the following procedure. 

Step 1 Determine the constant matrices ^/p, uj, and the reply matrix 
function r |(o,2T) from (112). Find a(0) from uj. 
Find the operator C^ by (115). 

Step 2 Fix ^ e (0,T]. In the subspace J-^ <Z J-^ construct a C^- 
orthogonal basis of controls {^fj^i (see (125)). 

Step 3 Compose {R'^) by (111). Choose two constant vectors a, f3 G 
R\ 

Find the value p{c2£,) from (135). 

Step 4 Varying ^ € (0,r] and repeating the previous Steps, recover 
the solution p in W^^"^ . Recall that this solution is determined by the chosen 
vectors: p = Pap(x). 

Step 5 Varying properly a, (5 and using the equation (132) in the form 

Ap'ap + Bpc,p = pip 

as a system of equations for the unknown A^ B, determine these coefficients 
in n^-'^ 22_ 

The inverse problem is solved. One more way is to make use of the 
GLK-equation: we can determine the control /'^ from (142), recover p(c2^) 
by (136), and then continue by the Steps 4,5. 



Such a system determines A,B uniquely. Indeed, fix an xq G fi'^^l" As the 
map {a, /3} i— > Papi^o) is already recovered, one can choose nonzero o,/3 such that 
Papi^o) = a-nd then recover A(xq) from the system. Analogously, one can determine 

B{xo). 
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Remark The following additional question arises. We have recovered 
the coefficients in the slow interval fi^^^. In the mean time, in accordance 
with the locality principle, the inverse data i?^"^, / are determined by the 
behavior of the coefficients in the fast interval ^'^^'^ . To what extent do the 
inverse data determine A and B in fl'^^^ \ i^'^^^l The question is answered 
in [12], where it is shown that R^^,l do not determine A,B |n'^i^\n=2T 
uniquely and the character of non-uniqueness is revealed. 



7.3 Visualization of waves 

Besides the determination of coefficients, the BC-mcthod provides a re- 
markable opportunity to see the waves on a beam. Of course, once the 
coefficients have been determined, one can find the waves just by solving 
the forward problem (73)-(75). However, speaking about a visualization we 
mean a certain straightforward way to recover the map f i-^ u^ through the 
inverse data without solving the forward problem. 

Let the external observer choose a smooth control / e JF"^. After the 
Steps 1,2, one can compose the operator Vf , find the control Vi f (sec (127)), 
and recover the part u^ {■ ,T) |j2C2^ of the wave u^ {■ ,T) by the amplitude 
formula (131). Moreover, replacing in the AF the control / by the delayed 
one D-^^*/, the observer can determine uf{ ■ ,t) Iqcst for all t € (0,T] (see 
(69)) and thus visualize an evolution of the wave on the interval fl'^'^"^ . 

Visualization provides the picture of waves, what is much more signifi- 
cant than just determination of coefficients. As is easy to see, such a picture 
determines the coefficients. Indeed, one can visualize u-^^ , u-^^ , • ■ • , u^" for a 
rich enough set of controls /i, /2, • ■ • , /n and then ffiid A, B from the system 
of equations (73) written for m^* . 

Visualization of waves via relevant versions of the geometrical optics 
formulas is the main tool for solving multidimensional inverse problems by 
the BC-mcthod [4], [6], [7]. 

7.4 Finite beam 

As we saw, in the case of the semi-infinite beam the response operator 
R^^ and the delaying function I constitute two independent parts of the 
inverse data. Here we show that, for a finite length beam, certain additional 
measurements at its second endpoint enable one to determine I and recover 
the coefficients everywhere on the beam. 

A beam of a finite length h with the free ^^ second endpoint is described 
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by the system 



putt - Uxx + Aux + Bu^O, 

u\t=Q = Ut\t=Q = 0, 
u\x=0 = /, Ux\x=h = 0, 



0<x<h, 0<t<T (143) 

a; > (144) 

0<t<r. (145) 



Assume that, in addition to ul{Q,t) |o<t<2T7 one can measure the values of 
uf{h,t) |o<t<T- 



Take T = — and choose a control f 



where m — m(t) is an 



arbitrary smooth function. If (and only if) one puts m — l{t) then the 
corresponding wave is slow: 



0. 



«<: 



Such a wave does not manifest itself at the second endpoint and the external 
observer records the rest at x = h for all t G (0,T). In particular, the first 
component of the wave, which is a scalar function^'*, satisfies 



1 Uh,t) = Q, 



0<t<T. 



(146) 




observer 1 

Figure 17. Observations at x — h 



This component does not contain (^-singularities: see Fig 11 c,d 
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It can be shown that the converse is also true: the relation (146) is valid 
only if 

m = l{t), 0<t< M - — jT 

holds. 

The latter fact provides an efficient way to find the delaying function 
[10], [14]. Namely, it is the function /, on which the fimctional 



1 
F[m] := 



2 

m] 



6 
u{- {h,i) 



dt 



defined on smooth functions m{t), < t < (1 — — IT. attains the global 



ci 

minimum, the minimal value being equal to F[l] ~ 0. 

As soon as / is determined, one can recover the coefficients and visualize 
waves by means of the procedures described in the previous section. 

7.5 Comments and remarks 

• If pi = /92, SO that the wave modes propagate with one and the same 
velocity, the forward and inverse problems can be treated in the same 
way. Moreover, such a case is much easier for investigation. The rea- 
son is the following. Scaling the x- variable and seeking for the solution 
in the form u{x,t) = S{x)v{x,t), one can choose the matrix-function 
S such that v satisfies vu — Vxx + Bv = ^^. The properties of the 
dynamical system governed by this equation are quite analogous to 
ones of the scalar string: there are no slow waves, the local control- 
lability (43) does hold. Therefore to modify the procedure of sec 4.2 
for recovering the matrix-function _B is a simple exercise. The theory 
of such systems is well known [1], [18]. 

• If A = and B = B\ the beam equation takes the form putt — u^x + 
Bu = and the beam is described by three independent functional pa- 
rameters &ii,foi2,&22- By this, the determination _R^^ => B becomes 
well motivated. However, in this case, our approach leads to the cer- 
tain nonlinear equations [13] and it is not known, whether the suitable 
linear analog of (142) does exist. It is an interesting and important 
open question. 
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i.e., one can remove the 1-ordor term Aux, what is impossible if p\ ^ P2! 
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• In [12], the characterization of the inverse data (i.e., the necessary 
and sufficient conditions for R'^'^,1 to be the inverse data of a system 
a^) is proposed. We have to notify that the paper contains a minor 
inaccuracy: the term ujf in (100) is missed and therefore the main 
result is vahd under additional assumption A{0) = 0. However, to 
extend the characterization to the general case is not a serious task. 

• One can easily modify our approach to be available for another types 
of the boundary conditions, e.g., for the Neumann control Ux U=o= /• 

• Another approaches to the inverse problem for the beam based on 
the gradient (optimization) methods see in [27], [28]. Elementary 
introduction to the multidimensional BC-method can be found in [6]. 



Bibliography 

[1] S.A.Avdonin, M.I. Belishev. Boundary control and dynamical inverse 
problem for nonself adjoint Sturm-Liouville operator (BC-method). 
Control and Cybernetics, 25 (1996), No 3, 429-440. 

[2] M.I. Belishev. On an approach to multidimensional inverse problems 
for the wave equation. Dokl. Akad. Nauk SSSR, 297 (1987), no 3, 524- 
527. English translation: Soviet Mathematics. Doklady, 36 (1988), no 
3, 481-484. 

[3] M.I. Belishev. The Gclfand-Levitan type equations in multidimensional 
inverse problem for the wave equation. Zapiski Nauch. Semin. LOMI, 
165 (1987), 15-20 (in Russian); English translation: /. Sov. Math., 50 
(1990), no 6, 1940-1944. 

[4] M.I. Belishev. Boundary control in reconstruction of manifolds and 
metrics (the BC method). Inverse Problems, 13 (1997), no 5, R1-R45. 

[5] M.I. Belishev. Dynamical systems with boundary control: models and 
characterization of inverse data. Inverse Problems, 17 (2001), 659-682. 

[6] M.I. Belishev. How to see waves under the Earth surface (the 
BC-method for gcophysicists) . Ill-Posed and Inverse Problems, 
S.I.Kabanikhin and V.G.Romanov (Eds). VSP, 55-72, 2002. 

[7] M.I. Belishev. Recent progress in the boundary control method. Inverse 
Problems., 23 (2007), no 5, R1-R67. 

[8] M.I. Belishev. Boundary control and inverse problems: 1-dimensional 
variant of the BC-method. Zapiski Nauch. Semin. POMI, 354 (2008), 
19-80 (in Russian); English translation: J. Math. Sciences, v. 155 
(2008), no 3, 343-379. 



Boundary Control Method 149 

[9] M.I.Belishev, A.S.Blagovcschcnskii. Dynamical Inverse Problems of 
Wave Theory. SPb State University, St-Petersburg, 1999 (in Russian). 

[10] M.I.Belishev, A.S.Blagovestchenskii, S.A.Ivanov. Erratum to "The 
two- velocity dynamical system; boundary control of waves and inverse 
problems [Wave Motion 25 (1997) 83-107]". Wave Motion, 26 (1997), 
99. 

[11] M.I.Belishev, S.A.Ivanov. Boundary control and canonical realizations 
of two- velocity dynamical system. Zapiski Nauch. Semin. POMI, Til: 
18-44, 1995 (in Russian); Enghsh translation: J. Math. Sci. (New 
York) 87 (1997), no. 5, 3788-3805 

[12] M.I.Belishev, S.A.Ivanov. Characterization of data of dynamical in- 
verse problem for two-velocity system. Zapiski Nauch. Semin. POMI, 
259 (1999), 19-45 (in Russian); English translation: J. Math. Sci., 109 
(2002), no 5, 1814-1834. 

[13] M.I.Belishev, S.A.Ivanov. On uniqueness "in the small" in dynamical 
inverse problem for a two-velocity dynamical system. Zapiski Nauch. 
Semin. POMI, 275: 41-54, 2001 (in Russian); Enghsh translation: J. 
Math. Sci. (N. Y.) 117 (2003), no. 2, 3910-3917 

[14] M.I.Belishev, S.A.Ivanov. Recovering the parameters of the system 
of the connected beams from the dynamical boundary data. Zapiski 
Nauch. Semin. POMI, 324: 20-42, 2005 (in Russian); English transla- 
tion: J. Math. Sci. (N. Y.) 138 (2006), no. 2, 5491-5502 

[15] M.I.Belishev, T.L.Sheronova. The boundary control method in dy- 
namical inverse problem for inhomogeneous string. Zapiski Nauchn. 
Seminarov LOMI, 186 (1990), 37-49 (in Russian). English translation: 
J. Math. Sci., 73 (1995), no 3, 320-329. 

[16] M.I.Belishev, A.V.Zurov Effects associated with the coincidence of ve- 
locities in a two- velocity dynamical system. Zapiski Nauchn. Seminarov 
LOMI, 264 (2000), 44-65 (in Russian). Enghsh translation: J. Math. 
Set., Ill (2002), no. 4, 3645-3656. 

[17] A.S.Blagovestchenskii. On a local approach to the solving the dy- 
namical inverse problem for inhomogeneous string. Trudy MIAN 
v. A. Steklova 115 (1971), 28-38 (in Russian). 

[18] A.S.Blagovestchenskii. Inverse Problems of Wave Processes. VSP; 
Netherlands, 2001. 

[19] I.M.Gelfand, B.M.Levitan. On the determination of a differential equa- 
tion from its spectral function. Izv. Acad. Nauk SSSR, 15 (1951), 
309-360 (in Russian). 



150 M.I. Belishev 

[20] B.Gopinath and M.M.Sondhi. Determination of the shape of the human 
vocal tract from acoustical measurements. Bell Syst. Tech. J., July 
1970, 1195-1214. 

[21] B.Gopinath and M.M.Sondhi. Inversion of the Telegraph Equation and 
the Synthesis of Nonuniform Lines. Proceedings of the IEEE, vol 59, 
no 3, March 1971, 383-392. 

[22] R.Kalman, P.Falb, M.Arbib. Topics in Mathematical System Theory. 
New-York: McGraw-Hill, 1969. 

[23] M.G.Krcin The solving of the Sturm-Liouville inverse problem. Dokl. 
Akad. Nauk SSSR, 76 (1951), no 1, 21-24 (in Russian). 

[24] M.G.Krcin. On inverse problems for a non-homogeneous string. Dokl. 
Akad. Nauk SSSR 82 (1952), no 5, 669-672 (in Russian). 

[25] M.G.Krcin. On a method for the effective solution of the inverse 
boundary- value problem. Dokl. Akad. Nauk SSSR, 95 (1954), no 6, 
767-770 (in Russian). 

[26] J. -L. Lions. Controle optimal de syst emes gouvern 'es par les equations 
aux d criv 'ees partielles. Dunod Gauthier-Villars, Paris, 1968. 

[27] A.Morassi, G.Nakamura, M.Sini. An inverse dynamical problem for 
connected beams. European J. Appl. Math, 16 (2005), no 1, 83-109. 

[28] A.Morassi, G.Nakamura, M.Sini. A variational approach for an inverse 
dynamical problem for composite beams. European J. Appl. Math, 18 
(2005), no 1, 21-55. 

[29] D.L.Russell. Controllability and stabilizability theory for linear partial 
differential equations. SIAM Review, 20 (1978), no 4, 639-739. 

[30] V.S. Vladimirov. Equations of Mathematical Physics. Translated from 
the Russian by A.Littlewood, Ed. A.Jeffrey, Pure and Applied Mathe- 
matics, 3 Marcel Dekker, Inc, New-York, 1971. 

[31] A.V.Zurov Effects associated with the coincidence of velocities in a 
two-velocity dynamical system. Zapiski Nauchn. Seminarov LOMI, 
297 (2003), 44-65 (in Russian). Enghsh translation: J. Math. ScL, 127 
(2005), no. 6, 2364-2373. 



Dynamic Characterization and 
Damage Identification 



Fabrizio Vcstroni and Annamaria Pau 

Dipartimento di Ingcgnoria Strutturale e Geotecnica 
Universita di Roma La Sapienza, Roma, Italy 

Abstract This chapter presents selected cases of modal parameter 
evaluation and damage identification, privileging the applied as- 
pects. The identification of modal parameters is discussed through 
three experimental cases of masonry structures: a civil building, 
a monumental structure and a nineteenth century railway bridge. 
Modal parameters are response quantities that are sensitive to dam- 
age, whose variation with respect to an undamaged state can be 
exploited in view of damage detection and identification. More- 
over, the variations of modal quantities can be used for updating 
the structural model, leading to the best estimate of model mechan- 
ical parameters in the undamaged and damaged conditions. This 
provides information on the damage location and severity. The tech- 
nique of damage identification based on model updating is applied 
to laboratory experimental tests on a steel arch. 

1 Introduction 

Within the inverse problems, identification of structural models is a very 
meaningful subject; indeed in this book different Chapters are devoted to 
the matter. Notwithstanding great improvements have been made in mod- 
elling the dynamic behaviour of structural systems, due to different reasons 
- mainly, simplifying assumptions arc generally introduced and knowledge of 
the actual parameter values can only be approximate - it is thoroughly felt 
that the experimental data obtained from the real structure can be conve- 
niently used to make the models more reliable. The identification of models 
with limited initial information or realistic constraints on the unknowns to 
be identified, referred to as reconstruction problems, are very complex and 
only in few cases a unique solution is guaranteed (Gladwell, 1984; Gladwell 
et al., 1987; Gladwell and Morassi, 1995; Dilcna and Morassi, 2010). With 
respect to the indeterminacy and ill-conditioning of the problems, it seems 
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more realistic to use all the a priori information coining from the theory of 
materials and structures for building an initial model of the structure, then 
to exploit the experimental data, when available, to improve and update 
the model. To this goal the best use of experimental data is achieved by 
referring to a system identification technique that allows the suitable corre- 
lation of a priori analytical information with test results, bearing in mind 
that frequently little information is available from experimental tests, as 
few points can be instrumented, and the number of conditions that can be 
investigated is limited (Natke, 1982, 1991; Mottershead and Friswell, 1993; 
Capecchi and Vestroni, 1993, 1999; Ghanem and Sture Eds., 2000; Morassi 
and Vestroni, 2008). 

Moreover, model identification is a fundamental step of damage evalu- 
ation and structural monitoring (Brownjohn et al., 2003; Megalhaes et al., 
2008; Morassi and Vestroni, 2008). In this context a certain popularity was 
gained by parametric models that have a well-established structure and the 
only object of identification is a number of their parameters, whose optimal 
estimate takes advantage from the experimental information coming from 
the dynamic response to external excitations. 

Among parametric models, here, attention is focused on two important 
classes: modal models and physical models, such as finite clement models. 
Modal models are based on the assumption that the system dynamics can 
be represented by a series of uncoupled second-order differential equations 
with constant coefficients, and the modal parameters are the unknowns to 
be identified. In the physical models, a wider use is made of a priori infor- 
mation, and identification is mainly devoted to model updating by adjusting 
the values of the selected structural parameters. If the model is accurate 
in reproducing the structural behavior on the whole, these parameters will 
effectively maintain the physical meaning. 

The choice between physical or modal models thus depends on whether 
it is more reasonable to formulate a realistic analytical model able to re- 
produce the structure dynamics with accurate values for the geometrical 
and mechanical characteristics, or whether it is possible to develop a suffi- 
ciently wide experimental investigation, where a high number of response 
quantities are measured. Notwithstanding the greater effectiveness of the 
physical models, which remain the final goal of the identification process, 
modal models are always the first step of any dynamic identification pro- 
cedure, because their parameters (frequencies, eigenvectors and damping) 
are the basic dynamical characteristics of a structure. In particular, the 
experimentally evaluated modal parameters, when they are a complete set, 
make it possible to identify modal models that are predictive models in the 
direct problem, that is, they are able to furnish the response of the struc- 
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turc to any known excitation. When the experimental response is measured 
in a few points, and therefore only an incomplete set of modal eigenvalue 
components is determined, it is not possible to obtain an effective modal 
model of the structure, but the experimental values of the modal parameters 
determined are still important for at least two reasons: (a) their variation 
entails a modification of the mechanical characteristics of the structure, 
thus they are an efficient tool to detect damage and (b) they can be used 
as experimental data to identify the physical model. 

Finite clement models take into account all the available a priori infor- 
mation on the continuum models, the material behaviour and the external 
and internal constraints. The response to a given excitation, furnished by a 
finite element model, is governed by a set of coefficients that are strictly re- 
lated to the physical quantities, such as, the coefficients of mass or stiffness 
matrices. These physical quantities represent the parameters of the models. 
The estimation procedure of these quantities, by using the experimental 
data, is generally referred to as structural identification or model updat- 
ing. Uncertainties regarding system modelling are restricted to the values 
of some physical parameters governing the response of the model (Motter- 
shead and Friswell, 1993). The optimal values of these are determined so 
as to minimize the difference between the measured and predicted response 
quantities. However, physical models are much more involved and, due to 
their theoretical structure, are more rigid with respect to modal models in 
fitting the given experimental results; thus, parameter identification is more 
difficult and onerous. Just as an example, it is difficult to reproduce the 
sequence of the modes with the exact ratios among the frequencies, which 
is an intrinsic characteristic of a structure, mainly due to the unavoidable 
modelling errors. This has led to a more widespread use of modal models 
in practical engineering, even though physical models are more attractive, 
as they are more robust and refined. The identification of the finite element 
model of a structure remains an important goal of structural identification, 
as it is typically used in direct problems. Moreover, it is the only model 
that is potentially able to give local information on the structural proper- 
ties, which is crucial to damage detection (Friswell and Mottershead, 1995, 
1998; Capecchi and Vestroni, 1999; Vestroni and Capecchi, 2003; Morassi 
and Vestroni, 2008). 

Damage identification based on finite element models is a specific model 
updating that uses the variation of the measured data, like modal quantities, 
from the undamaged to damaged conditions. It is important to distinguish 
between two different situations: (a) diffused damage as a consequence of 
an infrequent event (strong wind, earthquake, explosion, etc.): this case is 
similar, but more complex than the initial identification of the structure. 
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where it is reasonable to consider some regularity in the distribution of the 
mechanical characteristics, and the inverse problem exhibits a high number 
of unknowns, frequently leading to an indeterminate problem (Teughels and 
De Roeck, 2004; Morassi, 2007; Cruz and Salgado, 2008); (b) concentrated 
damage related to a local phenomenon of deterioration or overstress (cor- 
rosion, crack) , in this case damage is located in a few sections and modifies 
few stiffness characteristics, which constitute a small number of unknowns, 
frequently leading to an over-determined problem (Vestroni and Capecchi, 
1996; Capecchi and Vestroni, 2000; Pau et al., 2010). 

In this Chapter, case-studies are presented to illustrate procedures and 
difficulties in modal identification and model updating for damage evalua- 
tion. As far as modal models are concerned, different methods are summa- 
rized and employed for structures excited by harmonic forces, transmitted 
by a mechanical shaker, and ambient excitation induced by traffic. As far 
as damage identification is concerned, the case of a damaged arch due to a 
crack is dealt with, which involves model identification in the undamaged 
condition and updating of the model to fit the response of the damaged 
structure. A procedure, which takes advantage of the peculiarity of the 
damage identification problem, is applied to an experimental case. 

2 Identification of Modal Parameters 

In structural dynamics, the multidegree-of-freedom systems represent dis- 
crete or discretized continuous systems. The governing equations of the 
motion of a linear system with n-degrees-of-freedom can be written in ma- 
trix form (Craig, 1981; Meirovitch, 1997): 

Mu{t) + Cu{t) + Ku{t) = p{t) (1) 

where M, C and K are respectively the mass, damping and stiffness matri- 
ces, with dimensions n x n, and p(i) is the nxl force vector. If the damping 
matrix C is a combination of M and K (classical damping), the damped 
system will have real eigenvectors coincident with those of the undamped 
system, so that the equations of motion can be decoupled by describing the 
displacement motion in terms of the eigenvectors weighted by the modal 
coordinates q(i): 

u(t) = *q(t)-^*,,q,(t). (2) 

r 

By substituting this expression in the equations of motion, then premulti- 
plying by the matrix ^^ and using the orthogonality eigenvector conditions, 
n uncoupled modal equations are obtained: 

CJr (t) + 2C,r^rq_r {t) + krQr {t) ~ Pr {t) r = 1, . . . n. (3) 
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Hence, the system behaviour is represented as a Hnear superposition of the 
response of n single-degree-of-freedom systems. A system with n degrees 
of freedom has n natural frequencies ujr and n related mode shapes $r, 
along with the modal damping ratios C^- These quantities are the dynamic 
characteristics of the structure and can be determined from the analysis of 
the response. 

In a single-degree-of-frccdom system, the Fourier transform of the re- 
sponse f/(w) is connected to the Fourier transform of the forcing function 
F{uj) by the frequency response function H{lu) (FRF), according to the lin- 
ear relationship U{uj) — H{ui)F{uj). In n-degrees-of-freedom systems, the 
FRF is an n X n matrix, connecting the displacement at node i caused by 
a force applied at node j: 



FA^) £^fc,[(l-r2) + z2C,.r 



r=\ 



where r^ ~ to jiOr is the ratio of the forcing frequency w to the rth system 
frequency uJr- Once the registration of the response is available, there are 
various procedures in the time or frequency domain to determine the modal 
parameters of the structure (Maia and Silva, 1997). Some of these proce- 
dures require the FRFs, that is, the response to a measured input (harmonic 
or impulsive force) must be known. Other methods, such as the peak pick- 
ing or the singular value decomposition, are based on the analysis of the 
output-only response. The following paragraphs simply describe these three 
methods, which are used in the applications presented. 

2.1 Frequency Response Function Method 

The method furnishes an optimal estimate of modal parameters based 
on the minimization of the error between the experimental FRF*^ and the 
related analytical quantity expressed by equation (4): 

e^H,M-Ht,W). (5) 

The used algorithm is based on a multimodal estimate in the frequency 
domain, following the pioneering work by (Goyder, 1980). In the neigh- 
bourhood of the rth resonance, the inertance H^iuS) can be regarded as the 
sum of two terms: 
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The former is the prevaihng resonant term and the latter is the contribu- 
tion of all the other modes. By comparing the experimental and analytical 
inertance functions, the error at the frequency ujk around ujr can be defined: 



e, = H,,{u;,) - H\f{u,) = 77; - C, (7) 

■' a,- — w^ + ibrUJk 

where the constant Ck is given by the difference between the experimental 
value of the FRF and the modal contributions of all the other modes s 7^ 
r, and the following modal quantities are introduced as variables of the 
problem: 



,2 



w^ 



Zi^^iO^ , Cjj> (pi'^Cpjy. \pj 



The error function can be linearized with respect to the assumed variables 



by means of a suitable weight function wj. ~ — l/(a,. ~ w^ + ib,.uJk) 



Ck = —„ ^ , 'J,./"" Cfe = w^WfeCijv + WfeCfe (flj. - Wfc + ihrU]^ . (9) 

The procedure follows an iterative scheme: at the /ith iteration, wu and Ck 
are determined by the values at the ft, — 1 step. The objective function is 
obtained by summing the square error in a frequency range in the neighbor- 
hood of ojr ■ On the basis of the least-squares method, the modal unknowns 
ar,br,Cijr arc determined. At each step, this procedure is extended to all 
meaningful resonances that appear in the response, and, at the end of the 
process, all the modal parameters in the range of frequencies investigated 
are determined. 

2.2 Peak Picking 

This method is very often used for its simplicity in analysing the ambient 
vibration response, when the input is unknown (Bendat and Piersol, 1980). 
The ambient vibration response of a structure cannot be predicted by de- 
terministic models, within reasonable error. Each experiment produces a 
random time-history that represents only one physical realization of what 
might occur. In general, the response x{t) of the structure to ambient ex- 
citation is recorded for a very long time, even for hours, which enable to 
cut the random process x(t) into a collection of subrcgistrations Xk{t) which 
describe the phenomenon. 

The Fourier Transforms of the fcth subrcgistrations of two random pro- 
cesses Xk{t) and j/fc(i) are respectively: 



Dynamic Characterization and Damage Identification 157 

^fc(/,r)= / Xfc(i)exp-'2-/*dt (10) 

Jo 

Yk{f,T)^ f yk{t)cxp~^'^fUt. (11) 

Jo 

The auto (or power) spectral density (PSD) and cross-spectral density 
(CSD) and related coherence function between the two random processes 
are respectively: 

^,.(/)= hm ii?[|Xfc(/,r)n (12) 

1 — >CXD 1 

S,yif)^lim ^E[Xk{f,T)Yk{f,T)] (13) 

1 — >oo 1 

^""^^^ = sjf)Syyif) ^^^^ 

where the symbol E[.] indicates an averaging operation over the index k 
and the overbar denotes complex conjugate. 

Let us now assume that x(t) is the input and y{t) is the output. The auto- 
spectral and cross-spectral density functions satisfy the important formulae: 

Syyif) = \H.vif)\'S,M) S^yif) - H,y{f)SUI) (15) 

where Hxy{f) is the frequency response function (see equation (4)). The 
simple peak picking method is based on the fact that the autospectrum 
(15i), at any response point, reaches a maximum either when the excitation 
spectrum peaks or the frequency response function peaks. To distinguish 
between peaks that are due to vibration modes as opposed to those in the 
input spectrum, a couple of criteria can be used. The former concerns the 
fact that in a lightly damped structure, two points must oscillate in-phase 
or out-of-phase. Then, the cross spectrum (162) between the two responses 
provides this information, which can be used to distinguish whether the 
peaks are due to vibration modes or not. The second criterion uses the 
coherence function (14), which tends to peak at the natural frequencies, as 
the signal-to-noise ratio is maximized at these frequencies. 

2.3 Singular Value Decomposition 

The second method referred to also relies only on the response to am- 
bient excitations (output only). The method is based on the singular value 
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decomposition of the response spectral matrix (Brincker et al., 2001), ex- 
ploiting the relationship: 



Syy (oj) ~ H (u;) Sxx (w) H (cu) 



(16) 



where Sxx{uj) {Rx R, R number of inputs) and Syy{uj) (M x M, M number 
of measured responses) are the input and output power spectral density 
matrices, respectively, and H(w) is the frequency response function matrix 
{M X R). 

Supposing the inputs at the different points arc completely uncorrelated 
and white noise, Sxx is a constant diagonal matrix, independent of w. Thus: 



whose term jk can be written, by omitting the constant 5, as: 
Syyjk (t^) 




r=l \p=l P 



(17) 



(18) 



In the neighbom'hood of the ith resonance, the previous equation can be 
approximated by: 



Syyjk {(^) 



r— 1 * 



Sfei'/ 



4>ji4>ki 



R 



:^Xi 



(A? 



■^^)(A| 



W2) A- 



(19) 



By ignoring the constant ^ 0^^, Syy can thus be expressed as the product 



of the three matrices: 



r=l 



S2/y(c^) = *A,*^ (20) 

which represent a singular value decomposition of the matrix Sj/j/, where: 

1 0... 

0... 











0... 



(21) 



This is valid in the neighbourhood of every natural frequency of the system, 
that hence emerges as a peak of the first singular value. The first column of 
the matrix # contains the first singular vector, which in the neighborhood 
of the jth resonance coincides with the ith eigenvector. This occurs at 
each resonance, when the prevailing contribution is given by the related 
mode. This procedure has recently had great diffusion mainly for in situ 
experimental tests and has also been implemented in commercial codes. 
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2.4 The S. Sisto School 

S. Sisto school is a natural stone masonry building in L'Aquila. It con- 
sists of a main two-storey rectangular structure about 10 m high. The plan 
of the main building is rectangular, with sides having a length of about 12 m 
(Figure 1). The height of the storeys is irregular. A steel beams and hollow 
tiles mixed floor divides the different storeys. The building is structurally 
non-symmetric because of the eccentricity of the staircase. The sensors 
were located on the first and second floors, as shown in Figure 1, where 
their positions and measurement direction arc indicated as (C1Y-C12X). 
The structure was excited with a shaker located in four different positions 
and directions (V1X-V4Y), also shown in Figure 1. Forced harmonic vi- 
bration tests were performed; the rotating frequency of the actuator was 
increased from 5 to 15 Hz with a step increment of 0.05 Hz. The modal 
parameters were identified by using the iterative procedure defined by the 
minimization of the error function (9) . 
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Figure 1. Building plans and locations of the accelerometers (C1Y-C12X) 
and of the shaker (V1X-V4Y) at ground level (a), 5.40 m (b), 9.50 m (c) 
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Figure 2 shows the frequency response functions (FRFs) obtained at 
CSX and C9Y from the four different locations of the shaker. Two groups 
of peaks emerge: the first represents three global modes involving the whole 
structure; the second contains four peaks corresponding to the higher modes, 
with local deformations of the structural elements. When the position of the 
shaker is varied, some differences in natural frequencies and mode shapes 
are observed. The frequencies are reported in Table 1 together with the 
related damping coefhcients, while the mode shapes are depicted in Figure 
3. Mode 1 is a bending mode along the Y axis, mode 2 is approximately 
a bending mode according to the X axis and mode 3 is a torsional mode. 
Both modes 1 and 2 involve a certain amount of torsion. This is due to the 
imperfect structural symmetry for the presence of the staircase. 
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Figure 2. FRFs at CSX and C9Y for different locations of the shaker 



Table 1. Natural frequencies [Hz] and damping ratios (in parentheses) ob- 
tained with the different shaker locations 



Shaker location 


Mode 1 


Mode 2 


Modes 


VIX 


5.95 (2.1%) 


6.41 (2.2%) 


7.65(1.6%) 


V2Y 


- 


6.25 (2.4%) 


7.76 (1.6%) 


V3X 


6.10 (2.1%) 


6.41 (1.8%) 


8.17(1.2%) 


V4Y 


- 


6.23 (2.2%) 


7.76 (1.6%) 
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The differences in the observed frequencies may be ascribed to nonlin- 
earities, which often occur in masonry structures, even at a low level of 
vibration and to the high damping coefficient connected to the vicinity of 
modes. Nevertheless, a procedure for the updating of the physical model 
requires that only one frequency is defined for each mode, and then a selec- 
tion is made by observing the time-histories on the basis of their consistency. 
This brings to the definition of the frequencies reported in bold in Table 1, 
related to V3X for the first frequency, to V4Y for the second and to V4Y 
and V2Y for the third. Figure 4 shows the experimental frequency response 
functions and those obtained by using the identified modal parameters. A 
satisfactory agreement is reached; the fitting is better for the first group 
of modes than for the second one and for the anti-resonances. The modal 
parameters, experimentally evaluated, are then used to update the finite 
element model of the structure; the details can be found in (De Sortis et al., 
2005). 
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Figure 3. Comparison between the first three identified mode shapes for 
different shaker locations 



2.5 The Flavian Amphitheatre 

Known worldwide as the Colosseum, the Flavian Amphitheathre stands 
in the center of Rome, near to an underground line and heavy road traffic. 
The amphitheatre is elliptical in shape with major and minor axes being 
188 m and 156 m in length, respectively; its external wall is 50 m in height. 
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Figure 4. Comparison between experimental and identified FRFs for two 
sensors and shaker positions 



The structure consists of a series of dry masonry travertine radial walls 
connected by mixed travertine/tuff concentric annular walls and by inclined 
concrete vaults. The heterogeneity of the soil in the foundations and various 
earthquakes have caused settlements that have led to collapses, which have 
given the Colosseum its present shape, wherein the original symmetry is 
partially lost. 

Different kinds of tests were performed on the Flavian Amphitheater. A 
detailed description of all the experimental investigations can be found in 
(Pau and Vestroni, 2008). Triaxial recordings enabled the characterization 
of the excitation and vibration level. Figure 5, as an example, compares 
the power spectral densities (PSD) of the accelerations at the base and on 
the upper level of the monument (L4, about 40 m high). This figure shows 
that the frequency content of the vertical and tangential accelerations at 
the base and level L4 have similar features. By contrast, the power spectral 
densities of the radial acceleration reveal the filtering effect of the struc- 
ture. In fact, the frequency content of the radial response at the base is 
similar to that in the other directions, but easily distinguishable peaks ap- 
pear at the upper levels. The natural frequencies are in fact expected in this 
band, which is actually investigated more in detail with radial recordings. 
These tests have been performed by measuring accelerations along seven 
vertical sections with six accelerometers placed at different heights (levels 
L1-L6) on the external wall. When moving from one vertical to another, 
one extra accelerometer was kept on the previous line to track the phase. 
Impact vibration tests have also been carried out on the columns, using an 
instrumented hammer, to determine the material properties from the wave 
propagation velocity. 
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Figure 5. PSDs of accelerations at level L4 (a) and base (b) 



Experimental modal analysis using ambient vibrations must face the 
problem of input forces being unknown. The key condition is that the 
excitation must have a flat spectrum in the frequency band around the fre- 
quencies to be estimated. This is verified, and then the natural frequencies 
are determined by simply peak picking from the PSDs. In order to obtain 
further evidence of the results, the coherence function 7 is also observed, 
which also tends to peak at natural frequencies. Figure 6 (a) and (b), respec- 
tively, report the cross-spectral density (CSD) and coherence of the radial 
accelerations at a section. Figure 6(a) shows a couple of sharp resonance 
peaks at 1.03 Hz and 1.49 Hz respectively, whose amplitude increases from 
LI (Sgi) to L5 (upper level, 855 ). At these frequencies, the coherence tends 
to peak, although not reaching the unit value, because of the smallness of 
the response amplitude (Figure 6b). Other peaks are observed in the range 
1 — 2 Hz: in all cases their amplitude increases from LI to the upper levels 
and the coherence reaches a maximum. The values of these frequencies are 
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reported in Tabic 2, which also indicates the sections where these resonance 
peaks are observed. Their values are summarized in the last column of the 
table. 




Figure 6. CSDs Sgj between the response at L6 and Lj (a) of radial accel- 
erations and related coherence (b) 



Table 2. First experimental frequencies [Hz] observed at different sections 



section 


24 


27 


32 


39 


47 


51 


53 


fe 


LI 


1.03 














1.03 


L2 












1.30 


1.30 


1.30 


L3 


1.49 


1.48 


1.47 




1.49 


1.49 




1.49 


L4 






1.59 


1.62 




1.60 




1.60 


L5 






1.65 




1.67 






1.66 


L6 






1.75 


1.73 


1.76 






1.75 



A finite element model of the structure was also built, accurately repro- 
ducing the present geometry and using the material characteristics derived 
from the literature. The comparison between experimental /g and numer- 
ical frequencies fa and the related mode shapes is reported in Figures 7 
to 9, which show that experimental frequencies are approximately half of 
the numerical ones. If the ratios between Young's modulus and mass den- 
sity, E/ p, of the different materials are homogeneously reduced in order to 
minimize the errors between /e and /„, these frequencies are brought into 
satisfactory agreement, according to Table 3. The reduction required for 
the ratio E/ p is considerable, however, a similar result has already been ob- 
served in other cases of similar ancient Roman masonries. Furthermore, the 
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updated value is strongly related to the experimental result obtained by the 
impact tests on the columns. In these tests, the propagation velocity of the 
pressure waves Cp was experimentally evaluated: the arrival time of a wave 
generated by an impact at one point was observed at different points and a 
mean experimental value of 1220 m/s was obtained for Cp. The parameter 
Cp is related to the mechanical properties of the medium, E, p and Pois- 
son's coefficient v according to: Cp = \/E{\ — v)/[p{l + i'){\ — 2v)]. The 
experimental estimate of Cp is 0.42 times the value furnished by the previous 
formula, using the same material properties, formerly assumed in the finite 
clement model. This result requires a reduction in the ratio E / p, similar to 
that obtained by the process of minimisation of the differences between the 
natural frequencies. 



Table 3. Experimental and analytical frequencies of the Colosseum [Hz] 
and related errors 



mode 


1 


2 


3 


4 


5 


6 


le 


1.03 


1.30 


1.49 


1.60 


1.66 


1.75 


fa 


1.13 


1.14 


1.51 


1.54 


1.74 


1.76 


100- i^-r^^i 


9.71 


12.31 


1.34 


3.75 


4.82 


0.57 
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Figure 7. Comparison between I'** and 2'"^ analytical and experimental 
mode shapes 
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Figure 8. Comparison between 3'"'' analytical and experimental mode 
shapes and 4*'' experimental mode 



4"' mode 



5"' mode 




5"' mode f,= 3.17Hz 

L6, . ^ 





FEM 



L6 
L5 
L4 



f-= 3.58 Hz e* mode 



24 27 32 39 47 51 53 24 27 32 39 47 51 53 

f5= 1.66 Hz experimental f^= 1.75 Hz 

Figure 9. Comparison between 4*'* and 5*'' analytical and 5*^* and 6*^ 
experimental mode shapes 
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In order to compare the mode shapes, the values of the modal assurance 
criteria (MAC) between the analytical and experimental mode shapes were 
calculated (Table 4). The MAC is defined as the scalar product between 
two vectors, representing an analytical and an experimental mode, each one 
normalized with respect to its modulus. The first two modes arc clearly 
identified, as appears from the high value of their MAC coefficients and 
also from their images in Figure 7. The experimental mode shapes are 
represented by the contour plots of the radial components. The accordance 
between the third modes is satisfactory, while the fourth experimental mode 
does not exhibit a similarity to any analytical mode (Figure 8). The fifth 
and sixth experimental modes look like the fourth and fifth analytical ones, 
considering that high MAC values for higher modes in the out-of-diagonal 
terms are mainly due to the small number of measurement points (Figure 
9). 

Table 4. MAC coefficients between analytical and experimental mode 
shapes 



a\e 


1 


2 


3 


4 


5 


6 


1 


0.93 


0.10 


0.27 


0.03 


0.01 


0.03 


2 


0.10 


0.89 


0.11 


0.02 


0.00 


0.06 


3 


0.14 


0.12 


0.69 


0.23 


0.49 


0.51 


4 


0.12 


0.05 


0.28 


0.32 


0.64 


0.50 


5 


0.13 


0.02 


0.69 


0.20 


0.42 


0.62 


6 


0.15 


0.09 


0.37 


0.47 


0.45 


0.39 



2.6 The Vallone Scarpa Bridge 

The Vallone Scarpa bridge was built at the end of the nineteenth century 
and is located along the Roma-Sulmona railway line, which crosses the 
central Italian region of Abruzzo. It is a masonry arch viaduct with thirteen 
bays, each with a span of 10 m. The piers arc about 9 m in height. The 
plan has a radius of curvature of 400 m; the slope of the line is 2.7 %. 

The ambient vibrations of the bridge were recorded using two different 
arrangements of accclcromcters. The measurement directions were: in the 
plan of the deck, tangent to the bridge axis (L, longitudinal) and related 
orthogonal line (T, transverse); and the vertical direction (V) along the 
viaduct axis. In the first setup, transverse sensors were placed on the top 
of each pier, together with biaxial sets (T, L) located at the middle of each 
span. In the second setup, triaxial sets of accelerometers (L,T,V) were 
placed on the deck edges of the three central bays. 
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3.883 Hz 




4.792 Hz 



f= 5.349 Hz 



Figure 10. Experimental frequencies and mode shapes of the Vallone 
Scarpa bridge. 



The natural frequencies and mode shapes were determined from the am- 
bient vibration responses by using the previously described technique of 
singular value decomposition. Using the first arrangement of sensors, the 
first five frequencies were determined, showing that the mode shapes are 
bending modes of a beam over elastic supports in the plane of the deck, as 
shown in Figure 10, which depicts the first five modes of the structure and 
reports the related frequencies. The main component of the mode shape is 
transverse, as shown in Figure 11, which reports, for the sake of brevity, a 
comparison between the longitudinal and transverse components of the first 
and second mode shapes only. The measurements performed with the sec- 
ond arrangement of sensors provided similar results, but showed in addition 
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that the identified modes present a shght rotational component, which is 
pointed out by the fact that the deck edges oscillate out-of-phase. This is 
shown in Figure 12, only for modes 1 and 2 for brevity. Furthermore, in the 
second arrangement of sensors there was a slight variation of frequencies, 
but an inversion between the order of modes 1 and 2 was found (see Table 
5). According to these measurements, the first mode presents one node, 
while the second one does not have any. This phenomenon is related to the 
closeness between the two frequencies of the arch in the horizontal plane 
and to the possible slight variation of mechanical parameters. A complete 
explanation of the phenomenon would require the repetition of the mea- 
surements and verification of their robustness with regard to the ambient 
conditions. 



mode 1 



mode 2 




10 15 

sensor n' 



Figure 11. Comparison between longitudinal and transverse components 
of the first and second experimental modes. 



3 Damage Identification 

In structural identification problems, the system model is characterized by 
the structural parameters a;, which are to be identified by the measured 
output z* to the input /*. The system with generic parameters x, excited 
by the true input /*, does not reproduce the exact output z* , but responds 
with the generic output z. The challenge consists in identifying the correct 
values of the parameters x* , so as to give the response values close to the 
measured ones. 

An important application of the identification techniques is damage eval- 
uation. When dealing with damaged structures, a modification of the re- 
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mode 2 



6 8 10 
sensor n° 




6 8 10 
sensor n° 



Figure 12. Comparison between longitudinal, transverse and vertical com- 
ponents of experimental modes 1 and 2. 

Table 5. Frequencies of the Vallone Scarpa bridge determined with the two 
sensor arrangements 



mode 


1 


2 


3 


4 


5 


first setup 


3.869 


3.883 


4.235 


4.792 


5.349 


second setup 


3.854 


3.839 


4.191 


4.909 


5.510 



sponse due to damage is observed. The same input /* acting on the un- 
damaged structure with parameters x^ produces an output z^ which is 
different from the output z^ of the damaged structure with parameters x^ . 
Variations in the behaviour of a structure can be associated with the decay 
of the system mechanical properties (Shen and Pierre, 1990; Morassi, 1993). 
The response in a damaged state can be represented as a deviation from the 
undamaged state z^ + Az^ due to a variation of the parameters x^ + Ax^ . 
Two possible approaches can be followed: in the first approach, the 
model parameters are identified in the two different states, the undamaged 
(x^) and the damaged (x^) states; the variation in the parameters {x^ — 
x^)/x^ furnishes the degree of damage. In the second approach, using 
a given model of the structure, the modification Ax^ /x^ are determined 
in such a manner as to fit, at best, the experimental modification of the 
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observed quantities IS.z^ . 

Both problems can be formulated as minimization problems of suitable 
functionals £, which in the absence of errors, is equal to zero for the exact 
values (.)* and are greater than zero for values different from the exact ones 
(.)*: 



£(a;*;z*) = and £(x;z*)>0 for x ^ x* (22) 

£(Aa;*;Az*) = and £(Aa;;Az*)>0 for Ax^^Aa;*. (23) 

Damage identification in a one-dimensional continuum due to open cracks 
is an interesting problem in practice and simple enough for discussing the 
basic aspects. A reduction in stiffness is interpreted as damage. 

In the past, this problem has been tackled as a general identification 
problem: the conditions for spectrum reconstruction have been discussed. 
The variations in the dynamic response due to cracks has been examined in 
numerous articles, a consistent number of which arc specifically directed at 
predicting the location and magnitude of the cracks. Among all these stud- 
ies, little attention has been paid to some peculiar aspects of damage identi- 
fication. It is important to distinguish between two different situations: (a) 
diffused damage, where the problem exhibits a high number of unknowns, 
it is certainly indeterminate and additional conditions must be considered; 
(b) concentrated damage, when damage is located in few sections; in this 
case, damage modifies few stiffness characteristics and the problem has a 
small number of unknowns. Unfortunately, in both cases, the problem has 
always been tackled by assuming all parameters to be unknown. Hence, the 
result is indeterminate and conditions not generally related to the physi- 
cal behaviour have been introduced to make it determinate (Vestroni and 
Capecchi, 1996; Ccrri and Vestroni, 2000; Morassi and Vestroni, 2008). 

3.1 Damage Identification in a Steel Arcii 

An experimental application of a damage identification procedure in a 
steel arch with concentrated damage, consisting of a notch on a given cross- 
section is presented. The damage parameters to be identified are the loca- 
tion s and stiffness constant k of a spring equivalent to the notch for its 
effects on the local increase in deformability. Four damage configurations 
were considered, characterized by the same location (s=0.3) with increasing 
intensity (D1-D4); the expected values of fc are reported in the first column 
of Table 6. The procedure is based on the comparison between numeri- 
cal and experimental variations of natural frequencies from undamaged to 
damaged states; hence the reference (undamaged) state has to be known. 
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In the solution to the inverse problem, it is important to be aware of 
the necessary number of measured quantities providing a unique solution. 
This goal can be reached by studying some aspects of the direct problem 
through a model, analytical or numerical, of the arch. In particular, for a 
damaged configuration characterized by the values of k and s, the model 
provides the values of the natural frequencies w^. On the contrary, when 
uJi is known for the damaged arch, for each possible damage position s, 
a stiffness ki{s) exists, which corresponds to a value of the ith natural 
frequency equal to LOi. Bearing in mind more complex situations, a FE 
model is adopted to solve the eigenvalue problem of the arch. Therefore, 
by taking into account all the possible positions of the damage, a curve 
ki{sj) can be obtained. Figure 13 represents the curves ki{sj) for the first 
five frequencies of vibration, obtained by using as damage parameters s= 
0.3 and k= 20. This figure can be used to examine the uniqueness of the 
solution to the inverse problem. Curves ki{sj) obtained for different tOi cross 
at the abscissa where the damage is located, providing the solution to the 
inverse problem. In fact, only at that point the values of k and s satisfy all 
the assigned frequencies. In the case of simply supported straight beams, 
the curves corresponding to the first and the second frequencies, ki{sj) 
and k2{sj) cross only once, providing the solution to the inverse problem by 
using only two frequencies (Figure 13 b). Regarding the arch presented here, 
however, each pair of curves shows more than one intersection, and thus the 
solution is undetermined when only two frequencies are known (Figure 13a). 
This is due to the wavelengths of curves ki{sj), which present at least one 
node along the length of the arch, one more than the simply supported 
straight beam. Thus, at least three different frequencies are necessary to 
uniquely determine the two damage parameters. 
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Figure 13. Curves ki{sj) for the first five modes of the arch a) and three 
modes of a beam b); s= 0.30, k= 20. 
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It should also be observed that the axis of the figures reported involves 
only one half of the beam because of the symmetry of the problem, which 
does not enable one to discern between two symmetric damage positions. 
This is a drawback of the frequency approach, which could be overcome by 
adding some information on the mode shapes or on anti-resonant data, as 
shown in (Dilena and Morassi, 2010). 

In real cases, when errors are present and more measurements than those 
strictly required are available, it is convenient to formulate the problem as 
a minimization problem. The optimal estimate of the damage parameters 
k and s is obtained by minimizing the objective function: 



G(fc, s) 



E 



AuJi{k,s) Awe 



w,- 



LO 



(24) 



This is defined as the sum of the squares of the differences between the nu- 
merical Auji(k, s) and experimental Acuei variations of frequencies between 
the undamaged and the damaged states, normalized with respect to the 
frequencies of the undamaged arch, ujY and w^. 

As the numerical values Auji(k,s) are provided by a FE model of the 
arch, it is convenient to obtain the damage parameters k and s in two phases, 
by successively seeking two distinct minima. For each possible discrete 
damage position Sj in the FE model, the minimisation of function (24) with 
respect to k provides the function: 



G{sj) = minG(fc,Sj). 



(25) 



The solution to the inverse problem is then given by the minimum of G{sj) 
over s. If this function exhibits one global minimum, the solution to the 
inverse problem exists and is unique. 



3E-005 



2E-005 



>^ 



lE-005 - 




Figure 14. Objective functions using frequencies 1-5 a) and 1,2,5 b) for 
the case D2. 
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As observed in the direct problem, the minimum number of frequencies 
necessary to have a unique solution is three. This is further confirmed by 
Figure 14, which reports the objective functions built with five (a) and three 
frequencies (b), and in particular frequencies 1-2-5, related to the experi- 
mental case of damage D2. For brevity, only case D2 is reported. For both 
functions (a) and (b), the global minimum is unique, notwithstanding the 
fact that other local minima appear, due to close multiple intersections in 
the curves ki(sj). The use of more than three frequencies does not necessar- 
ily provide better results, as is shown in Table 6, referring to the objective 
function with five frequencies and Table 7 for three frequencies. These ta- 
bles report the values of the spring stiffness and damage positions obtained 
at the global minimum of the objective function for the four damage config- 
urations considered. Both objective functions provide notable errors only in 
the weakest damage scenario (Dl); in all other damage scenarios satisfactory 
results are obtained. The error strongly decreases with increasing damage, 
where systematic errors are less important. However, case (b) provides, in 
general, smaller errors due to the exclusion of outliers (third frequency) or 
frequencies having a low sensitivity to damage in the considered position 
(fourth frequency). 

Table 6. Expected and identified stiffness, notch depths S, damage locations 
and errors (a, i=l-5) 





k 


hd 


% error 


(5 [mm] 


% error 


s 


% error 


Dl 


570 


1161 


103.7 


0.6 


44.3 


0.74 


3.6 


D2 


205 


249 


21.7 


1.8 


11.1 


0.72 


0.8 


D3 


91 


98 


8.2 


2.9 


3.3 


0.71 


0.0 


D4 


40 


36 


10.7 


4.1 


3.3 


0.70 


1.4 



Table 7. Identified stiffness, notch depths, damage locations 5 and related 
errors using frequencies 1,2,5 (b) 





hd 


% error 


(5 [mm] 


% error 


s 


% error 


Dl 


527 


7.5 


1.1 


6.2 


0.65 


8.5 


D2 


205 


0.2 


2.0 


0.1 


0.71 


0.0 


D3 


89 


1.4 


3.0 


0.6 


0.72 


0.8 


D4 


36 


10.1 


4.1 


3.1 


0.71 


0.0 



In the weakest damage scenario (Dl), the frequency variations arc small, 
and the accuracy of the model becomes a key factor. In fact, in this case, 
the addition of an axial spring modelling the axial stiffness reduction helps 
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lE-006 




lE-009 



Figure 15. Objective fvmctions obtained from the model with (bold) and 
without (thin) reduction of axial stiffness for the case Dl. 



in identifying the right notch location. This is shown in Figure 15, which 
shows the objective functions obtained when using three frequencies and 
wether the reduction in the axial stiffness induced by damage is considered 
(bold line) or not. More details can be found in (Pau et al., 2010). 



4 Conclusions 

There is a growing interest in the experimental assessment of the models 
of existing structures, to better understand and predict their behaviour 
and, more importantly, to know how their evolution may be used within 
a health monitoring strategy. Within the inverse problems, attention has 
been focused here only on parametric model identification, where available 
experimental data are used to obtain an optimal estimate of the model 
parameters. In particular two categories of models are considered: modal 
models and finite element models. In the first the unknowns to identify are 
the modal parameters (frequencies, eigenvector components, damping), in 
the second the parameters are the mechanical characteristics of the model. 
Parametric identification is a typical inverse problem, and as such it 
is generally a difhcult task; however, modal model identification is easier 
and this explains its wider use and the implementation of the procedure 
in commercial codes. The most important methods are summarized and 
case-studies have been presented for two different scenarios, when input 
and output are measured and when output-only is measured; this latter 
case is rapidly spreading, because nowadays the recording of ambient vibra- 
tions is feasible even for large structures. The results presented for large 
and complex structures, such as historical buildings, arc a demonstration 
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that experimental modal analysis is reliable enough to furnish an experi- 
mental evaluation of the modal parameters and a dynamic characterization 
of the structure; the evolution of the parameters makes it possible to find 
out the modification of the structural characteristics and to detect damage. 
Modal parameters are still the experimental quantities most frequently used 
in the successive identification of more refined models. The identification of 
the physical parameters of the finite element models of a structure is more 
involved, but these parameters are representative of local structural char- 
acteristics; their evolution is potentially able to give a detailed description 
of damage. The identification of damage in an arch due to an open crack 
has been presented to illustrate the use of FE model updating, based on 
the experimental data, like modal quantities, evaluated in undamaged and 
damaged conditions. The procedure, which takes into account the peculiar- 
ity of this inverse problem, is able to localize damage and to determine its 
intensity, a step more ambitious than damage detection; the results are sat- 
isfactory, but it has been shown how important the accuracy of the model 
and quality of experimental data are. 
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Abstract Matrix methods using mass, damping and stiffness terms 
are widely used in vibration analysis of structure and provide the ba- 
sis for active control of vibrations using state-space methods. How- 
ever modern vibration test procedures provide reliable receptance 
data with usually much greater accuracy than is achievable from sys- 
tem matrices, M, C, K, formed from finite elements. In this article 
procedures are described for pole placement by passive modification 
and active control using measured receptances. The theoretical ba- 
sis of the method is described and experimental implementation is 
explained. 

1 Introduction 

Receptances were first used in the passive modification of structures by 
Duncan (1941) and the pole assigment problem was described as a classical 
inverse problem in active control by Kautsky et al. (1985). In this article the 
use of measured receptances for pole assignment in passive modification and 
active control is described. The considerable advantages of this approach 
are explained. 

In conventional modelling the dynamic behaviour of a structure is de- 
termined from mass, damping and stiffness matrices, M, C, K, usually 
obtained from finite elements. In theory the receptance matrix, which is 
measurable experimentally, is the inverse of the dynamic stiffness matrix, 

His)^Z-\s) (1) 

Z(s) = s^M -h sC -^ K (2) 
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In reality finite element models contain approximations, assumptions and 
errors not present in measured data. The terms in the receptance matrix, 
H(s), are dominated by the eigenvalues (or poles) closest to the frequency 
of excitation, 

H(.)=E(^ + ^) (3) 

, — ■ S — Ak S — A,. 

k=l '^ 

where (pk denotes the fc*'* eigenvector. 

It may be shown, however, that the dynamic stiffness terms, in Z(s), 
are dominated by the poles furthest away - usually the higher modes, most 
affected by discretisation and the least accurate from a finite element model. 
We see that H(s) may be represented very accurately by a truncated set 
of modes whereas Z(s) generally may not be. Furthermore, the problem 
of dynamic-stiffness error cannot be overcome by inverting the accurately 
measured matrix of receptances. Within the limited frequency range of 
a vibration test, and when s = iw, the inversion of H(iw) is not able to 
reproduce Z(ia;) because the effect of the high-frequency modes is negligibly 
small, within the measurement noise (Berman and Flannelly, 1971). 

The receptance terms may be expressed by the ratio of two polynomials, 

where d{Xk) = is the characteristic equation of the original system, with 
eigenvalues Afe, k ^ 1,2,..., 2n. The ij*^^ term in N(s), provides the char- 
acteristic equation of the zeros, nij{^k) = 0, A; = 1, 2, . . . , 2m, m < n. This 
means that the vibration response at coordinate i vanishes when a force is 
applied at coordinate j with frequency ^j.. The assignment of zeros is im- 
portant for delicate instruments etc. that will not tolerate harsh vibration 
environments. 

The polynomial d{s) and the matrix N(s) may be expanded as, 

dis)^f[{{s^X,)is^\l)) (5) 

fc=i 

and 



diag(s — Aj) 

diag(s - A*) 



[ $ $* ]^ (6) 



N(s) = [ $ $* ] adj 
where $ is the modal matrix of eigenvector columns. It follows from equa- 
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tion (6) that rEink(N(s)) = 1 if and only if s = A^; Afe G Aj-; j — 1,2, . . . ,n, 

(11 n \ 

n (>^k-\i)X{{\k-x*)\'Pi (7) 

The dynamic stiffness equation, 

Z(s)x(s) = f(,s) (8) 

is a 'force' equation. Every term in x(s), the displacement or state vec- 
tor, must be present for the equation to be complete. This is the reason 
why, in state-space control problems, it is necessary to use an observer (or 
equivalent) to estimate the unmeasured states. In industrial-scale problems 
the number of coordinates can be many thousands, or even millions, and 
therefore it is necessary to use model reduction methods to condense the 
problem to an acceptable size. On the other hand, the receptance equation, 
expressed as 

H(s)f(s) = x(,s) (9) 

is a 'displacement' equation. For each row of the matrix equation to be 
complete it is only necessary to know the applied forces, usually at a small 
number of coordinates. Therefore for every sensor location, the measured 
terms in x(s), one only needs to know the non-zero terms in f(s), which are 
usually measured in an experiment. Therefore, there is no requirement for 
an observer or for model reduction when using the receptance method. 

In practical problems we may be interested to modify the spectrum (the 
natural frequencies and damping ratios) of a system, usually to avoid res- 
onances with excitation frequencies. This may be achieved by modifying 
the system, either passively by adding mass, damping or stiffness terms, 
or by active control. The advantage of passive modification is that the 
modified system is guaranteed to remain stable. However, as will be seen 
later, practical modification requires difficult measurements, typically rota- 
tional receptances. The advantage of active control is that it offers much 
greater freedom in the 'form' of the modification, but this is achieved at the 
expense of 'compensation' to ensure stability. The simplest modification is 
the unit-rank modification and a straightforward solution is available by the 
Sherman-Morrison formula, 

H(.) = H(.) - H(.)u(.)v^(.)H(.) 

^' ^' l+v^(s)H(s)u(s) ^^' 

where H(s) is the receptance matrix of the modified system. The charac- 
teristic equation is, 

l + v^(^fe)H(Aife)u(A.fc)=0 (11) 
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where ^k, k — 1,2, . . . ,2n are the eigenvalues of the modified system. 



2 Passive Modification 

Probably the simplest passive modification is a spring eonneeting a single 
coordinate to earth. In this case, 

u ^ V = Gp (12) 

where Sp is the p*'' unit vector and p denotes the spring attachment coor- 
dinate. The characteristic equation becomes, 

1 + fcejH(/Xfc)ep = (13) 

The spring k that will exactly assign the eigenvalue /ifc is given by, 

-j^hppi^ik) (14) 



where hpp^fjik) = ejH(/ife)ep, the p*'* diagonal receptance term. A single 
added spring can assign only one eigenvalue exactly and the other eigenval- 
ues are then given by solving the characteristic equation (13). The modified 
stiffness matrix is given by K -|- kepel . 

Example 1. We consider a conservative lumped-mass, five degree of free- 
dom system, where, 

2 -1 
-1 2 -1 
K= -13-1 ; M = I 

-1 2 -1 
-1 2 

The eigenvalues and eigenvectors are given by, 

0.7203 

1 
A = 1.5202 

1.7321 

2.0421 



$ 



0.3578 0.5 

0.5299 0.5 

0.4271 0.0 

0.5299 -0.5 

0.3578 -0.5 



0.5765 -0.5 0.1993 

-0.1793 0.5 -0.4325 

-0.5207 0.0 0.7392 

-0.1793 -0.5 -0.4325 

0.5765 0.5 0.1993 
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It is found from equation (14) that a spring of stiffness k = 1.19 at 
coordinate 3 will assign an eigenvalue at 1.6 rad/s. In Figure 1 we see that 
the intersections of the horizontal line at — 1/fc with /133 (denoted by the 
dotted line) coincide with the peaks of ft.33. The eigenvalues at 1.0 and 
1.7321 rad/s are uncontrollable by the added spring, which is attached at 
a node, and therefore remain unchanged. They are also unobservable and 
connot be seen in Figure 1. 
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Figure 1: Unmodified and modified reccptanccs 



For further detailed information on the assignment of poles, zeros and 
vibration nodes by rank-1 passive modification the reader is referred to 
Mottershcad and Ram (2006), Tehrani et al. (2006) and Mottershead et al. 
(2001). 



2.1 Passive Modifications of Higher Rank 

Most practical passive modifications are not rank-1, but of considerably 
higher rank. A practical example might be to connect a beam between two 
points on a structure. If the connection points are unconstrained then there 
are generally six degrees of freedom, three translations and three rotations, 
that must be connected. This usually requires a modification of at least 
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rank-6. Likewise, if the modification takes the form of a large mass, there 
will be both translational and rotational inertia amounting to six degrees of 
freedom to be connected. The modified receptance matrix may be written 
as (Kyprianou et al., 2005), 

H(s) = (I + H(s)AZ(s))-iH(s) (15) 

where 

AZ(s) = s^AM + sAC + AK (16) 

denotes the dynamic stiffness of the modification. The characteristic equa- 
tion of the poles becomes, 

det(I + H(Ai,)AZ(Aifc)) = (17) 

What appears, from the mathematical formulation, to be an essentially 
straightforward problem of pole assignment is in fact very difficult in prac- 
tice. The difficulty lies in the measurement of the receptance matrix, H(s). 
In fact, receptances are not measured directly but are estimated from mul- 
tiple sets of accelerometer and force-sensor measurements. The estimation 
may be arranged to minimise cither the noise on the force signal or the noise 
on the output signal. The latter, known as the Hi estimate is probably the 
most widely used. Translational accelerometers and actuators have been in 
use for many decades and there is no difficulty in estimating point- and cross- 
receptances between translational inputs and outputs. The problem arises 
with the need to estimate those receptances that involve a rotational input 
or output. For example to estimate a rotation- rotation receptance it is nec- 
essary to impart a pure moment (without a force) to the structure and also 
to measure the rotational acceleration. Although rotational accelerometers 
are now commercially available it appears that the quality of measurement 
is not as good as conventional translational accelerometers. One way to 
overcome the problem is to excite the structure with forces and moments 
together and then to measure all the resulting translations and rotations; 
this becomes a multiple-input multiple-output problem. Mottershead et al. 
(2005) and Mottershead et al. (2006) used an array of accelerometers on 
a 'T' or 'X'-block attachment and by moving an electromagnetic shaker 
to different points on the attachment in turn, excitation was applied to the 
structure in many degrees of freedom. Of course, the attachment introduced 
a certain amount of additional mass to the system and was itself flexible. 
The arms of the 'T' or the 'X' (shown in Figures 2(a) and 2(b)) need to be 
sufficiently long that a significant moment excitation can be achieved. Also 
the effects of the forces applied at the tips of the arms need to be different 
and this is only possible if the tips are separated by a sufficient distance. 
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/j(2)x,(2) /„(3) /j(3) .v,(3) 



x,(l) 



iw. fid) 



.^o(3) 



■v. (2) 



x„(l) 




(a) T-block (b) X-block 

Figure 2: (a)T-block and (b)X-block forcing and response coordinates 



This requirement leads to an additional complication; that the attachment 
is likely to have natural frequencies of its own within the range of the test. 
This was overcome (Mottershead et al., 2005), (Mottershead ct al., 2006) 
by embedding a finite element model of the attachment into the estimator 
for the matrix of receptances. 



2.2 Modification by an Added Beam or a Large Mass 

The F-structure modified by an added beam to form a portal frame is 
shown in Figure 3. The 'T'-block attachment is shown in Figure 4. The in- 
verse problem was to assign natural frequencies using equation (17). A full 
description may be found in Mottershead et al. (2005). The dynamic stiff- 
ness matrix, AZ(s) was parameterised using the cross-sectional dimensions 
of the added rectangular hollow-section beam, b = breadth, d = depth, t = 
wall thickness. 

The characteristic equations, being high-order multivariable polynomials 
in b, d and t, were solved using the method of Groebner bases. In the cases 
of assigning a single natural frequency, two parameters were held constant 
and solutions for the third (variable) parameter were found. When two 
natural frequencies, or a natural frequency and an antircsonance, were to be 
assigned, then two parameters were varied. Among the multiple solutions 
produced, a value was found in each case corresponding (closely) to the 
parameter value in a beam used in the experiment to complete the portal 
frame. Typical results, presented in Table 1, show complex numbers, with 
small imaginary parts, returned for the variable parameters. This is because 
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Figure 3: Portal frame structure with a removable beam 




Figure 4: T-block experiment 
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the F-structurc contains a small amount of damping, which needs to be 
compensated in order to place the poles (natural frequencies) exactly on 
the imaginary axis of the complex s-plane. 

Table 1: Assigning (i) a natural frequency at 44.25Hz, {ii)two natural fre- 
quencies at 44.25Hz and 89.57Hz and (iii) a natural frequency at 44.25Hz 
and an antiresonance at 89.97Hz. 

Parameters Fixed parameters Solution 
~(T) t {b,d) = (0.1,0.05) 0.00406 + 0.00627i 
{ii) {b,d) i = 0.004 0.0924-0.0008i,0.0516+0.0022i 

(m) (bj) t == 0.004 0.0924-0.0008i,0.0516+0.0022i 

In a different experiment (Mottershead et al., 2006) the matrix of re- 
ceptances from a Westland Helicopters Lynx Mark 7 was estimated using 
an 'X'-block attachment. The effect of adding a large overhanging mass 
(equivalent to the tail rotor gearbox and hub) was then determined. The 
added mass was equal to the mass of the complete tailcone and therefore 
comprised a very significant modification. The problem, of course, was a for- 
ward one rather than an inverse problem. Figures 5 and 6 show the tailcone 
set-up including the use of the 'X'-attachment (Figure 5) and the added 
mass (Figure 6). The measurement of rotational receptances requires the 
use of special techniques not available from commercial modal test systems, 
which can be very time consuming, and a very high level of experimental 
expertise to obtain good quality results. However, as we see in the follow- 
ing section, the requirement of specialist techniques and expertise may be 
avoided completely when poles and zeros are assigned by active vibration 
control. 

3 Active Control 

The advantages to be had by pole placement using active vibration control 
are very considerable. Firstly there is no restriction to the form of the 
modification. Whereas passive modifications must be symmetric, positive- 
definite and conform to a certain pattern of non-zero matrix terms, no such 
restrictions apply in active control. Secondly, there is no requirement for 
rotational receptances. Finally, the number of eigenvalues to be assigned 
may be much greater than the number of actuators. Whereas in passive 
modification the number of eigenvalues to be assigned cannot exceed the 
rank of the modification, in active control a single actuator can in principle 
assign all 2n poles by a rank-1 modification. 
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Figure 5: X-block attachment to the tailcone 




Figure 6: Lynx Mark 7 helicopter tailcone with the added mass 



Eigenvalue Assignment Problems in Vibration 



189 



Wc begin by considering the case of single-input state feedback, and 
write the second order matrix equation as, 



(s^M + sC + K)x = hu{s) + p (s) 



(18) 



or 



where 



M 
M C 



M 
K 



u{s) 



sx 

X 



(sf+g) X 



u{s) + p{s) (19) 



(20) 



is the control input and p (s) is an external disturbance. 

Orthogonality conditions for the quadratic pencil were developed by 
Fawzy and Bishop (1976) and more recently by Datta et al. (1997), 






and 



^l [2A,M + C] ^fe = 1 
Vl [Xlm - K] ^fe = Afc 
By combining equations (18) and (20) we see that. 



K + bgM x(s) = p(.s) 



(21) 
(22) 

(23) 
(24) 

(25) 



amounts to a rank-1 modification to the dynamic stiffness matrix. 

Now, after omitting the external disturbance p(s), setting s =^ Afc and 
prcmultiplying equation (19) by {Xk^p^ fj), 



( Afc<Pfc fl ) f Afc 



M 
M C 



M 
K 



Xkfk 
Pk 



^^b) g^+AfefM^fc 



(26) 



The right-hand-side vanishes whenever Up^\)) — 0, the uncontrollahility 
condition, or (g -f Afcf J (/3fc = 0, the unobservability condition. We ob- 
serve that under either of these conditions the eigenvalue A^. remains un- 
changed by control action. 
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3.1 Single-Input State Feedback by the Receptance Method 

Application of the Sherman-Morrison formula to the rank-1 modification 
described in equation (25) leads to the modified receptance matrix (Ram 
and Mottershead, 2007), 



H(s) = H(s) 



H(s)b(g + .sf)'H(s) 



l + (g + sf)' H(s)b 
The closed-loop characteristic equation may be written as, 
l + (g + /ifcf)^H(Aifc)b = 



(27) 



(28) 



dil^k) + (g + /ifef) N(/ifc)b = (29) 

From equations (29) and (7) we sec that the system is uncontrollable (un- 
obscrvable) when N(/ifc)b = Q ((g -j- /i^f) N(/ife) = 0); /j-k = Xk- 

The characteristic equations for each assigned eigenvalue may now be 
assembled together in matrix form as, 



T 
T 



^2n Mll"2ra 



-1 



V -1/ 



(30) 



or. 



G 



where, 



?)^ 


/ -1 \ 

-1 


v-iy 


fe = H(A 


^k)h 



(31) 



(32) 



It may be seen that G is invertible when the closed-loop eigenvalues to be 
assigned are simple and controllable. In principle the complete spectrum 
of 2n eigenvalues may be assigned using a single actuator (i.e., a single 
non-zero term in b). 



Example 2. We consider the system described by. 



M 



■ 1 


■ 


; c 


[ u 


2 





1 -1 
-1 1 
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The poles to be assigned are, 

Ml = -1 + lOi; /i2 = -1 - lOi; Ms = -2; Mi = -3 

Then, wc find that G is given by, 

0.0102 + 0.00211 -0.0097 + 0.00201 -0.0110-0.10431 

0.0102-0.00211 -0.0097-0.00201 -0.0110 + 0.10431 

0.1236 0.2360 -0.2472 

0.0714 0.1111 -0.2143 



-0.0101 - 0.09891 
-0.0101 + 0.09891 

-0.4719 

-0.3333 



The control gains arc found to be. 



68.8750 
30.3750 



f = 



-62.6750 
68.1750 



It is readily demonstrated that the desired eigenvalues are assigned, by 
solving the eigenvalue problem. 



Mfe 





M 



M 
C + hf 



M 






K + bg^ 










The method described above was extended by Ram et al. (2009) to 
include the effects of time delay such as occur in analogue to digital (A 
to D) and D to A conversions, use of digital filters, other digital processes 
and slow response of actuators etc. Mottershead et al. (2008) considered 
the problem of pole-zero placement using output feedback with collocated 
actuators and sensors. This results in a higher rank modification, greater 
than rank-1. 



3.2 Partial Pole Placement 

The purpose of partial pole placement is to assign certain chosen poles 
(usually the lower frequency poles) while other poles (usually at higher 
frequencies) remain unchanged. The usual justification given for doing this 
is the avoidance of 'spillover'. Generally, when a fewer number of poles than 
the full 2n are assigned the remaining ones are changed and may then have 
positive real parts causing the system to be unstable. It is this change to 
the unassigned poles that is referred to as 'spillover'. Therefore by avoiding 
spillover completely using partial pole placement, there is no possibility 
that the closed-loop system will become unstable. We should qualify this 
by saying that partial pole placement should be applied robustly to minimise 
the effects of uncertainty in the control gains and receptance measurements. 

There are two ways of formulating the partial pole placement problem 
using either the conditions of unobscrvability (Datta et al., 1997) or uncon- 
trollability (Tehrani et al.. 2010). We begin by partitioning the spectrum 
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and the modes. The poles to be assigned are denoted by, 



Ai 



AT 



diag(^j 



diag(/i* 



1,2, 



and those to be retained by, 



A, 



AS 



diag(Aj) 



; j = m + l,m + 2, . 



ri 



rm 



diag(A*; 
The corresponding modes are denoted by, 

[ *1 *t ] = [ V-l • • • V'r, 
[ *2 *2 ] = [ V-rn+l ' ' ' i^n V'™+1 ' ' ' i^n] 

Datta et al. (1997) showed that for an arbitrary vector /3 and, 

Ai 



(33) 

(34) 

(35) 
(36) 



that the poles 
condition, (g^ 



M[ *i 

A2 



^l 



Al 



/?; g = K[ *! ^l]p (37) 



a; 



remained unchanged. From the unobservability 



Afcf 1 ^Afe = 0, and using equations (37), 



Al 



[ *i ^ifMijkXk+i'ii nf^^k]^0 



(38) 

Since Afe G A2; ^fc G ^2 it becomes clear (from orthogonality) that the 
bracketed term on the left-hand-side is the null vector. The unobservability 
condition is fulfilled so that ^^ = Afc remains unchanged irrespective of the 
choice of /3. An explicit formula for /? is proveded by Datta et al. (1997). 

The advantage of this approach is that there is no need to know the 
spectrum and modes of the unchanged eigenvalues. However, use of the or- 
thogonality condition has the important practical implication that a sensor 
must be located at every degree of freedom of the system. It is not necessary 
for there to be an actuator at every degree of freedom. The vector /? may 
be chosen freely to assign the closed loop poles yitfe, /i^, k — 1, . . . ,m while 
independently the other open- loop poles ^fc, /i^ = Afc, A^, k = m + 1, . . . ,n 
are retained. 

The other way to achieve partial pole placement is by using the uncon- 
troUability condition, ■0jb = 0. We now express b in the form. 



b(.5) = bl + 



b2 

s 



(39) 
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since b must be perpendicular to both the real and imaginary parts of -0^ . 
Of course there are other acceptable forms of b apart from equation (39) . 
Then for the retained eigenvalues b(s) is sought that satisfies the expression, 



[^l 



or, 



A^-^^^ 



bi 
b2 



bi 
b2 







V = null [ 



*J 



A2-'*T 



null 



Va 
3 



{^D ^{K-,Hl) 



{^D ^{K-,Hi) 



(40) 



(41) 



(42) 



and a is an arbitrarily chosen vector. The gains g and f may then be 
selected to assign the closed- loop poles fik, fJ-l, k = 1, . . . ,m. 

The advantages of this approach are firstly that the spectrum may be 
separated into assigned and retained eigenvalues without using the orthog- 
onality conditions. This means that there is no requirement to know or to 
evaluate the system matrices M, C, K, which is entirely consistent with the 
receptance-based approach. Secondly it is not necessary to place sensors at 
all the degrees of freedom. It is however necessary to know the retained 
modes at the chosen sensor coordinates, but this is not usually a major 
problem when using a modern modal test system. In principle, the number 
of actuators required is one greater than the number of retained modes. 

Example 3. We consider the system described by. 



M 



K 



10 



20 



12 



2.3 

-1 



-1 

2.2 

-1.2 



-1.2 

2.7 

-1.5 



-1.5 
1.5 



40 


-30 






-30 


60 


-30 






-30 


90 


-30 






-30 


30 



The open-loop poles are, 

Ai,2 = -0.0108 ±0.87361 
A3^4 = -0.809 ± 1.67661 
As^e = -0.1336 ±2.52801 
At's = -0.3980 ± 4.02081 



We wish to assign the first two pairs of poles. 
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^12 = -0.03 ±li 
^3^4 = -0.1 ± 2i 

while the remaining poles are unchanged. 

The eigenvectors of the unassigncd poles arc found to be, 



^'. 



5,6 



-0.0941 T0.2578i 

-0.0829 T0.1727i 

0.1056 ±0.2807i 

-0.0738 T0.1775i 



V" 



7,8 



-0.0535 ±0.2107i 
0.0220 T0.0613i 

-0.0033 ± 0.0077i 
0.0006 T 0.00141 



Then, 



null 



5R 



V 

0.1996 
0.7146 
0.2723 

-0.5712 
0.1143 
0.1512 
0.0898 

-0.0719 



T 



^i, 



At 



At / 

-0.0023 
0.0394 
0.3580 
0.5065 
0.1676 
0.6275 
0.3726 

-0.2303 



/ 



-0.0649 

-0.2611 

-0.4771 

-0.4113 

0.0730 

0.3787 

0.5493 

0.2834 



0.0436 
0.1725 
0.3078 
0.2447 
-0.0610 
-0.2347 
0.2839 
0.8212 



We choose the first column from the null-space so that. 



0.1996 

0.7146 

0.2723 

-0.5712 



0.1143 

0.1512 

0.0898 

-0.0719 



This choice of b ensures that the fifth-eighth eigenvalues are retained. Now, 
using this b the gains g and f are determined in exactly the same way as 
before. 



10.1513 




4.4973 


12.1105 


; f = 


5.4989 


8.1401 


6.4880 


7.2688 




11.2309 
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3.3 Robust Pole Placement 

Mottershead et al. (2009) determined the sensitivity of the poles to small 
changes in the control gains, 






-efN(Mfe)b 



ds U=Mfc 



+ (g+M^fr^is=p.b+FN(Mfe)b 

-MjefN(^fc)b 

if U=^, + (g + M.fr^|.=^,b + FN(^fe)b 



(43) 
(44) 



and the sensitivity of the poles to small measurement errors was given by 
Tehrani ct al. (2010), 



djXk 
dhm 



-(g + Aifcf)^epe^b 



(g + Mfcf) 



T I 3H| 



dH^ 



-p Qg \s=tik^q 



=^,eT]h + fn(pk)h 



(45) 



We see from equations (43) and (44) that the eigenvalues are insensitive 
to the gains when N(/^fc)b — 0, which is once again the uncontroUability 
condition. 

For robust pole placement the sensitivity-matrix terms should be min- 
imised, 



dg. 






dh„„ 



(46) 



3.4 Practical Aspects 

In this section we consider the practical application of active pole place- 
ment using electromagnetic inertial actuators and accelememeters. The 
implementation is achieved by using the dSPACE system with MATLAB- 
Simulink. In the theoretical sections we considered displacement and veloc- 
ity feedback. However, the vibration response is measured using aecelcrom- 
eters and in order to obtain the displacement response double integration 
must be carried out digitally using dSPACE. Digital processing introduces 
control delays and errors. Therefore, instead of displacement and veloc- 
ity feedback we now use velocity and acceleration, thereby eliminating one 
integration and reducing the integration errors. The acceleration/voltage 
transfer function for such an actuator was given by Preumont (2002), 



ClS 



V{s) Trips'^ + (Cp + C1C2) s + Kp 



(47) 



196 



J.E. Mottershead, M.G. Tehran! and Y.M. Ram 



where rup is the reaction mass and Cp and Kp denote the damping and 
stiffness of the support attaching the reaction mass to the actuator casing. 
The terms ci and C2 arc electromagnetic constants. We see that use of the 
inertial actuator introduces an additional pole for each actuator used. 

One very significant advantage of the receptancc approach is that the 
use of an actuator model, such as equation(47), becomes unnecessary if the 
actuator is included in the measured receptance. Strictly the receptance 
is a displacement/force transfer function, so from now on wc use the term 
'receptance' very loosely as in output/input transfer function, in this case 
specifically as acceleration/voltage. Of course the accelerometers (which 
may also have significant dynamics, may also be included within this recep- 
tance. One of the problems of using an inertial actuator is that the actuator 
poles may be destabilised as a result of spillover. Therefore it is necessary 
to place a sensor on the reaction mass to form a nested feedback loop as 
shown in Figure 7. The reaction mass is not accessible in all inertial actua- 
tors. In the experiment described in this section wc used the Data Physics 
IV40 actuator. 

The experiment consists of a short cantilever beam with added mass at 
the tip sufficient to cause the node of the second mode to move to the tip, 
where the actuator is located as shown in Figure 8. Two accelerometers were 
used, one at the tip of the beam and the other on the actuator reaction mass. 
Since the actuator is placed at the node, the second mode is uncontrollable 
and remain unaffected by control action. This is an example of partial pole 
placement. Our objective is to place the actuator pole and the first pole of 
the beam while the second pole of the beam remains unchanged. 



v^^O-^<y-^ 



H(s) 



<> 



< a ♦ 



4 




1 



Figure 7: Nested controller 



The open-loop poles for the beam and actuator are. 
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Figure 8: Beam experiment 



Abi.2--5±419i 
Aai,2--8±205i 

The closed-loop prescribed values are, 



Mbl,2 ■ 
Mai, 2 



-8 ± 440i 
-12±170i 



Four equations of the following form must be solved, 
l+(Mfcg + Aifcf)^H(/ifc)b = 0; fc = l:4 



Receptance H(s) is obtained from the measured H(ia;), itself obtained 
by curve- fitting using the PolyMAX routine (Peeters et al., 2004), 




LR 



UR 



(48) 



where LR and UR are the upper and lower residuals representing the out 
of range modes. The measured and curve-fitted receptances are shown in 
Figure 9. The control gains arc found to be, 



-2.9794 
0.1508 



-36.2843 
18.5617 
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The simulated open-loop and closed-loop displacement/voltage receptances 
are shown in Figure 10 where it can be seen that the closed-loop peaks cor- 
respond closely to the imaginary parts of the assigned eigenvalues. Figure 
11 shows the measured actuator receptancc in a full experimental imple- 
mentation. The same actuator was used to provide the modal-test input as 
well as producing the control force. A programme of further experimental 
research is presently being undertaken and will be reported in due course. 

4 Conclusions 

The receptancc approach to pole placement in passive structural modifica- 
tion and active vibration control has been described and explained. The 
method is based entirely upon data from standard vibration tests and does 
not depend upon known or evaluated system matrices M, C, K. The sys- 
tem equations are expressed in the units of displacement, whereas force 
equations are formed when using the conventional M, C, K method. This 
eliminates the need for model reduction and the use of an observer to esti- 
mate unmeasured states. Active control offers much greater flexibility than 
passive modification in pole assignment. In principle, all 2n poles may be 
assigned actively using a single actuator. A distinct advantage in practical 
implementation is that elements of the control system, typically actuators, 
filters and sensors, which must be modelled by the conventional matrix 
method, may be included in the receptancc measurement. 
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Abstract In this set of notes we consider inverse problems in biome- 
chanical imaging. We briefly describe the clinical relevance of these 
problems and how the measured data is acquired. We focus on two 
distinct strategies for solving these problems. These include a direct 
approach that is fast but relies on the availability of complete inte- 
rior displacement measurements. The other is an iterative approach 
that is computationally intensive but is able to handle incomplete 
interior data and is robust to noise. We present some recent re- 
sults obtained using these approaches and also comment on future 
directions of research. 



1 Introduction 

Certain types of diseases lead to changes in the microstructural organiza- 
tion of tissue. Altered microstructure in turn leads to altered macroscopic 
tissue properties, which are often easier to infer and quantify than micro- 
scopic properties. Thus the measurement of macroscopic properties offers a 
window into tissue microstructure and health. Consider, for example, the 
progression of breast cancer during which cancerous cells in the milk duct 
activate fibroblasts which are responsible for enhanced recruitment of col- 
lagen (Kass et al., 2007). At the macroscopic level the enhanced collagen 
manifests itself in the form of a tumor with increased stiffness. By palpating 
the tissue in and around the tumor, one may then discern that it is stiff and 
hence infer that it may be cancerous. Similar associations may be developed 
for the cirrhosis of liver and for atherosclerosis in arteries. In biomechani- 
cal imaging (BMI) we aim to utilize this association between macroscopic 
mechanical properties of tissue and its health by generating images of the 
mechanical properties and using these to infer tissue pathology. 
Broadly speaking BMI involves the following steps: 



G. M. L. Gladwell et al. (eds.), Dynamical Inverse Problems: Theory and Application 
©CISM, Udine2011 



204 A.A. Oberai and P.E. Barbone 

1. The tissue is continuously imaged using a standard (phase sensitive) 
imaging technique hke ultrasound or MRI. 

2. It is deformed while it is being imaged. The deformation may be 
applied externally or it may be produced internally (by the pressure 
changes during a cardiac cycle, for example). Further, it may have a 
desired time dependence (quasi-static, time-harmonic, impulsive, etc.) 
and a desired amplitude (infinitesimal strain versus finite strain). 

3. The images of the tissue during its deformation are used to determine 
the displacement field within the tissue. For example, when ultra- 
sound is used to image quasi-static deformation of tissue, successive 
ultrasound images are registered using cross-correlation in order to 
determine the intervening displacement field. 

4. Using the displacement field in conjunction with an appropriate me- 
chanical model, an inverse problem is solved to determine the spatial 
distribution of the material parameters. 

This set of notes is concerned with the last step in BMI. That is solving 
the inverse problem of determining the spatial distribution of mechanical 
properties from the knowledge of its displacements. 

1.1 Types of Inverse Problems in BMI 

The inverse problems encountered in BMI may be classified on the fol- 
lowing basis. 

The time-dependence of the excitation. The excitation may be slow 
so that the response is quasi-static. It may also be time-harmonic, or im,- 
pulsive, or periodic as induced by the cardiac cycle. 

The magnitude of the induced strain. The excitation may induce 
small strain (less than 2 %) so that infinitesimal strain elasticity is ap- 
propriate, or it may induce large strains so that finite strains have to be 
modeled. 

The amount of data. In some instances it is possible to measure all 
components of the displacement field everywhere within the imaging vol- 
ume of the tissue. For example, when using MRI in conjunction with a 
time-harmonic excitation all components of the displacement field can be 
measured. In contrast to this in some instances only select components 
of the displacement fields are determined with reasonable accuracy. For 
example when ultrasound is used to determine the displacement field in a 
quasi-static deformation, the component of displacement along the direction 
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of propagation of the ultrasound waves (called the axial direction) is known 
much more accurately than the component perpendicular to this direction 
(called the lateral direction). Sometimes the lateral displacement estimates 
are so noisy that they are discarded while solving the inverse problem. Thus 
we may have inverse problems with complete or incomplete interior displace- 
ment data. 

From the point of view of developing algorithms for solving inverse prob- 
lems in BMI. the final classification, that is the one based on the amount of 
data, is the most relevant. This is because when complete interior data is 
available one may interpret the equations for the balance of linear momen- 
tum of the tissue as partial differential equations (PDEs) for the unknown 
material parameters. Thus the solution of the inverse problem is reduced 
to developing methods for solving these PDEs and one is lead to the so- 
called direct methods which do not involve an iterative solution strategy. 
This approach fails with incomplete interior data because in this case some 
terms in these PDEs are not known and iterative methods are employed. 
When compared with direct methods, iterative methods tend to be more 
computationally intensive. On the other hand they tend to be more ro- 
bust, because unlike the direct methods they do not work directly with the 
noisy measured data and its derivatives. Thus the emphasis in developing 
new iterative methods is to reduce their computational costs, while the em- 
phasis in developing new direct methods is to enhance their stability and 
robustness. 

In the following sections we describe some of our recent work in devel- 
oping direct and iterative methods for solving inverse problems in BMI. 
Several researchers have worked on both these types of methods (Ragha- 
van and Yagle, 1994; Kallel and Bertrand, 1996; Doyley et al., 2000; Sinkus 
et al., 2000; Oberai et al., 2003; McLaughhn and Renzi, 2006). The descrip- 
tion in the following sections is by no means a review of the state of the 
field, but rather a description of two approaches that we believe have the 
potential of working in a clinical setting. For a review of the experimental 
and computational techniques in BMI the interested reader is referred to 
Gao et al. (1996); Parker et al. (1996); Ophir et al. (1999); Greenleaf et al. 
(2003); Parker et al. (2005); Barbone and Oberai (2010). 
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2 A Direct Method Based on the Adjoint Weighted 
Equations 

The equations of balance of linear momentum for an clastic solid may be 
written as 

N 

V.(5]A(")^„) = /, (1) 



where fJ,i{x), ■ • ■ , ij,n{x) are the N scalar material parameters, A'"' (a;) are 
N tensors or vectors obtained from the measured displacements u and / 
is a scalar or a vector field determined completely from the measured dis- 
placement that typically represents the effect of inertia. Note that we view 
(1) as an equation for the material parameters rather than an equation for 
It. 

A large number of linear elastic systems may be written in the form of 
(1). For example: 

1. Compressible linear isotropic elasticity: here N = 2, and the A("'(a;) 
are tensors. The material parameter /zi = /i is the shear modulus and 
/^2 = A is the (first) Lame parameter. Further A^^^ = (Vu + SIu^)/2 
and A^'^'^ = (V • m)1, where u{x) is the measured displacement field, 
and 1 is the identity tensor. 

2. Incompressible linear isotropic elasticity: here N = 2, and the A^"'>{x) 
are tensors. The material parameter /ii = /i is the shear modulus 
and fi2 = —p is the pressure. Further A^^^ = (Vm + Vm-^)/2 and 
^(2) — 1 \Ye note that here we treat the pressure p as an unknown 
even though it is not a material parameter. 

3. Incompressible plane-stress linear isotropic elasticity: Under the plane 
stress hypothesis the pressure may be determined directly from /x and 
It. As a result the only unknown parameter is /i. Which implies 
iV = 1, /ii = M, and A(i) =Vu + Vm^ + 2(V • u)l. 

4. Scalar Helmholtz equation: The case of time-harmonic incompressible 
linear elasticity is the same as case (2) above where both /i„ and ^4'"^ 
are allowed to take on complex values. In addition / — —pixP'u, where 
the density p and the frequency lo are assumed to be known. In cases 
of antiplane deformation, an approximation is commonly made where 
the pressure term is assumed to be small and ignored. In this case a 
single component of u (the out of plane component denoted by u) is 
measured and used to solve for p. For this case we are lead to the 
scalar Helmholtz equation with N = 1, fj,i — fi, and A^^^ = Vm is a 
vector. 
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The system of equations for ^„ implied by (1) has several interesting 
features not found in typical partial differential equations. They are typi- 
cally hyperbolic, however, they do not come with any prescribed boundary 
data. This is because while imaging tissue in vivo it is highly unlikely that 
the material properties on any surface will be known. Also the system of 
equations arc rarely square. Often, there are more equations than unknowns 
in order to compensate for the lack of boundary data. For these reasons 
the solution of these equations using a numerical method, such as the finite 
element method, is challenging. An obvious solution to this problem is to 
consider using a least-squares approach. However we have found that this 
approach yields solutions that tend to smooth out sharp transitions in ma- 
terial properties and, worse, to produce errors that propagate far away from 
the these transitions into the rest of the domain (see Figure 2 for example). 

2.1 Adjoint Weighted Equations (AWE) 

In order to address the difficulties described above we have developed 
an alternative approach, that we refer to as the Adjoint Weighted Equa- 
tions (AWE) (see Barbone et al. (2007); Albocher et al. (2009); Barbone 
et al. (2010)). These equations are motivated by stabilized finite element 
methods, when the stabilization parameter tends to infinity. In addition to 
addressing the difficulties described above, these equations also lead to eas- 
ily verifiable conditions on measured data that lead to well-posed problems. 

The AWE are given by: find /i„ S iS„, such that Vu;™ G Sm, 

b{w„i,H)^l{wm)- (2) 

Where the bilinear form b[-,-) and the linear form l(-) are given by. 

N 

b{w^,fi) = (A('")-Vu;™,V-(^A(")^„)), (3) 

n=l 

liw^) = (A(™)-Vu;™,/). (4) 

In the equations above {w, v) = J„ w* v dx represents the L2 inner product 
for complex- valued functions w and v. 

In many instances more than one measurement is made in order to render 
the problem unique. For example, the tissue may be compressed during one 
measurement and sheared during another. Another alternative is to com- 
press the tissue non-uniformly. Yet, another alternative is to use waves that 
propagate in different directions. In all these cases there are M equations 
of the type of (1) that are satisfied. This is easily accommodated within the 
AWE formulation by simply adding up the bilinear forms corresponding to 
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each deformation. The premise here is that all bilinear forms are positive 
scmi-definitc, thus adding them produces a composite form that is "more" 
positive. The AWE for multiple measurements are also given by (2) where, 

M N 

b{w„,,ti) = ^(^("■"•Vz«™,V-(5]A("'')^„)), (5) 

1=1 rs=l 

AI 
l{Wra) = ^(^("''^-Vli;™, /(')). (6) 

1=1 

where A^''^' are the tensors corresponding to the /th measurement, and /'" 
are the corresponding vectors. 

A viable numerical method is constructed by the Galerkin discretization 
of the AWE formulation using standard finite element function spaces. This 
method is given by: find /ijj; w /i„ £ 5^ C 5„ be such that Mw^ € <S,^ C Sm, 

b{wt.f^")^l{wt)- (7) 

2.2 Properties of the AWE formulation 

In order to prove the well-posedness of the continuous AWE and optimal 
rates of convergence for their discrete counterpart, the following assump- 
tions are made: 

1. The bilinear form defines a natural norm. That is Vuim G Sm, with 

N N M 
7n— 1 n— 1 l — l 

2. Generalized Poincare inequality: There exists a positive constant Cp < 
oo such that Vw„i G Sm, 

N N M 

E E E("'™^ • ^^""'^^nV • A("'')) < c\\w\\\yc > Cp. (9) 

m— 1 n— 1 1=1 

3. Boundedness of the A-norm. That is there exists a finite, positive 
constant so that Vwm S Sm, 

\\w\\a<C\\w\U, (10) 

where || • ||i denotes the H^ norm. 
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Under these conditions we are able to prove that the continuous solution 
to the AWE, denoted by /x, exists and is unique. Further for the Galerkin 
solution /Lt'', we have the following optimal error estimate: 

\\^l-^l''\\A<chP, (11) 

where h denotes the mesh size and p is order of the complete polynomial 
represented by the finite element function spaces. 

We now present some examples that illustrate the performance of the 
AWE based finite element method. 

2.3 Numerical Examples 

Convergence : As the first example we consider a compressible material 
in plane strain configuration. This is represented by case 1 in Section 2. 
The material parameters are the Lame parameters fi and A, and thus in (1) 
N = 2. In order to achieve a unique solution two independent displacement 
fields are required. Corresponding to these we select the strain tensors 



e(i) 



e(2) 



l + x + y -2{x + y + l) 

~2{x + y+l) l + x + y 

7 + x + 2y -2{x + 2y + 5) 

-2{x + 2y + 5) 3 + x + 2y 



(12) 
(13) 



These yield the exact solution fj, — X — e^+^. We solve the problem nu- 
merically on grids ranging from 2x2 elements to 256 x 256 using the AWE 
method with bilinear finite elements. We compute both the L2 norm and 
H^ semi- norm errors and plot them in Figure 1. For both modulus fields 
we observe optimal convergence in the L2 error and better than optimal 
(super-convergence) convergence in the H^ semi-norm. 

Effect of Discontinuity : Next we consider an incompressible plane 
stress problem to test the ability of the AWE method to reproduce steep 
gradients in material properties. This is represented by case 3 in Section 2. 
In this case the shear modulus fj, is the only material parameter, thus N = 1. 
Further, a single measurement is sufficient to uniquely recover /x up to a mul- 
tiplicative parameter. The exact distribution comprises of a discontinuous 
circular inclusion embedded in a homogeneous background. The contrast 
between the inclusion and the background is 5:1 and the problem is posed 
on a unit square. The "measured" displacement field is obtained by solving 
the forward problem on a 100 x 100 mesh with the boundary conditions 
shown in Figure 2. Thereafter the displacement field is down-sampled to a 
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Figure 1. Error as a function of mesh size h in computing the Lame pa- 
rameters /i (left) and A (right). 



50 X 50 mesh and used to extract strain by employing an L2 projection of 
the derivatives on to standard finite element basis functions. These strains 
are then used to reconstruct fj, using both the AWE and least squares (LS) 
formulations with standard bilinear finite elements. The results are shown 
in Figure 2. From this figure we observe that the AWE solution recovers 
the shape as well as the steepness of the inclusion well. From the line plot 
along the diagonal we observe that it recovers the contrast quite well too, 
while the LS solution has significant errors throughout the domain. 
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Figure 2. Plane stress problem with a discontinuous distribution. Left: 
prescribed boundary conditions on the displacement. Center: Contour plot 
of the reconstructed shear modulus using AWE. Right: Reconstructed shear 
modulus along the diagonal for AWE and least-squares (LS) methods. 



Effect of Noise : Next we consider another incompressible plane stress 
problem with a circular inclusion. The set up is exactly the same as above 
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except the contrast is diminished (3:1). We generate the "measured" data 
as in the example above, but now consider three scenarios: with 0%, 3% 
and 10% additive white Gaussian noise in strains. The results arc shown 
in Figure 3. We observe that the reconstruction even with no noise is not 
perfect. This is likely because the assumptions for the optimal convergence 
of fj,'^ are violated for this data. As we add noise the reconstructions get 
worse. However even with 10% we are able to discern the inclusion and 
obtain a rough estimate of the contrast. It is to be noted that this is 
achieved with no regularization. 

Experimental Data : In the final example we consider the reconstruc- 
tion of shear modulus using experimental data. In this case the material is 
made from gelatin and the modulus distribution comprises of a stiff cylin- 
drical inclusion (made with a higher concentration of gelatin) within a softer 
homogeneous background. The diameter of the inclusion is about one-half 
the horizontal extent of the phantom and it is centered about 2/3rd of the 
way up from the bottom edge. The excitation is in the form a radiation 
force applied to one side of the inclusion. This generates a wave that travels 
from left to right and traverses the inclusion. Another excitation is made 
at the other side of the inclusion leading to a wave the travels from right to 
left. The time dependent out-of-plane component of the displacement cor- 
responding to these excitations is measured by cross-correlating successive 
ultrasound images. In processing this data the main dilatational wave is 
filtered out by recognizing that it travels much faster than the shear wave. 
This measurement as well as the techniques used to extract displacement 
data from it are described in Parker (2010). 

The displacement data are Fourier-transformed in time and the scalar 
Helmholtz equation is used as a model for the out-of-plane displacement 
component. This corresponds to case 4 in the discussion in Section 2. As 
a result in (1), N = 1 and M = 2. The force term is non-zero and is 
given by poj'^u, where the density p = lOOOkg/m^ and the frequency u = 
l-K * 333 rad/ s. The AWE method is used in conjunction with a total 
variation regularization to solve for the shear modulus at this frequency. 
Both real and imaginary components of the shear modulus are calculated. 
In Figure 4, we plot the real component of the shear modulus distribution. 
In this picture we can clearly see the inclusion, though its shape is somewhat 
distorted. We observe that the value of the shear modulus is quite close to 
the experimentally measured value of 10** Pa. 
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Figure 3. Reconstructions for the plane stress problem in the presence of 
noise. Top row: 0% added noise. Center row: 3% added noise. Bottom row: 
10% added noise. Left column: contour map of shear modulus reconstructed 
using AWE. Right column: Reconstructed shear modulus along line A — A'. 
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Figure 4. Shear modulus reconstruction using time-harmonic excitation. 
The units on the colorbar arc Pa. 



2.4 Future Work 

The AWE formulation has shown promise in initial tests. There are how- 
ever, several outstanding issues that need to be addressed. These include, 

1. We have quantified the performance of the AWE formulation for com- 
pressible elasticity and for plane stress incompressible elasticity. It is 
yet to be tested for incompressible elastic materials in plane strain 
and in three dimensions. These problems are likely to offer additional 
challenges. For example, in incompressible plane strain elasticity, even 
with two independent displacement measurements, the solution space 
is four dimensional. As a result four additional constraints must be 
imposed. Our initial results indicate that the final solution depends 
on the form of these constraints as well as how these constraints are 
applied. This issue needs to be better understood. 

2. When multiple displacement fields are used as input, they may be 
linearly combined to yield other valid displacement fields within the 
context of linear elasticity. Within the AWE formulation, different 
combinations lead to different results. It is expected that the method 
will benefit from a theory of optimal measurement weighting. 

3. While initial results indicate that AWE is robust to noise, in practice it 
is likely that some form of regularization will be required when dealing 
with measured data (see for example the last case in the previous 
section) . The role of regularization within the AWE formulation and 
its effect on the solution is yet to be worked out. 
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3 A Gradient Based Iterative Method 

As mentioned earlier the displacement data obtained in BMI is often in- 
complete in that only certain components of displacements are determined 
accurately. In addition, the displacement data is noisy. Thus differentiating 
it in order to evaluate the strain tensors A^") is problematic. Motivated 
by this, several researchers have adopted an iterative strategy where the 
equations (1) are not solved directly for the material parameters. 

In the iterative approach a functional that measures the difference be- 
tween the measured and a predicted displacement field is minimized. The 
predicted displacement field is determined by solving the forward elasticity 
problem with an assumed distribution of elastic parameters. This distribu- 
tion is improved upon iteratively, until the difference between the predicted 
and measured displacements is minimal. In the following paragraphs we de- 
scribe our iterative approach to solving the inverse elasticity problem. We 
describe this method in the context of solving the nonlinear inverse elastic- 
ity problem with quasi-static deformation. The interested reader is referred 
to Oberai ct al. (2003); Gokhale ct al. (2008); Oberai et al. (2009) for a 
detailed description. 

The functional to be minimized is of the form 

A/ N 

^M = 5] -||T(«w_) - r(««)|p + Y, ^n[n^]. (14) 

1=1 n=\ 

In the equation above uineas are the measured displacement fields corre- 
sponding to M different excitations, and it*-'' arc the corresponding pre- 
dicted displacement fields obtained by solving the forward nonlinear elas- 
ticity problem with assumed material parameter distributions. Thus these 
displacement fields are implicit functions of the N material parameters 
/x = [^i,--- ,^Ar]. Further T is a diagonal tensor that weights the con- 
tribution from different components of the displacement fields in inverse 
proportion to the error in their measurement. In particular, if a given com- 
ponent is unknown then the corresponding entry in T is 0. The functional 
7?. is a regularization term that ensures smoothness in the reconstructed 
parameters and a„ is the regularization parameter for the material param- 
eter [in. Typically we employ a regularized version of the total variation 
diminishing regularization given by 

Tl\[i\ = / V/3' + |V/i|2da;, (15) 

where the parameter /3 is selected to be small. It ensures the differentiability 
of the regularization function when V/i = 0. 
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In BMI enough interior data is available, so that local minima tend not to 
be a problem. As a result one may use gradient based algorithms to solve the 
minimization problem efficiently. We use the BFGS quasi-Ncwton method 
that builds accurate approximations to the Hessian using gradient infor- 
mation. In evaluating the gradient of the functional n with respect to the 
material parameters, terms of the type J„ Jm'" -v dx appear. In these terms 
t> is a vector that is easily determined, and 5u^ ' represents the variation in 
«(" corresponding to a variation Sjjb in the material properties. Instead of 
explicitly evaluating Su^^' , which would require as many linearized forward 
solves solves as the number of degrees of freedom used to represent /x (often 
around 10''), we directly evaluate the integral /^^ Su'^'''> -vdxhy introducing 
corresponding adjoint fields. This reduces the number of linearized forward 
solves to the number of measured displacement fields (seldom larger than 

4). ^ ^ 

Our iterative algorithm then proceeds as follows: 

1. Guess a material parameter distribution fi. 

2. Using fi solve M nonlinear forward elasticity problems to determine 
the predicted displacements m^'^. 

3. Using these and the measured displacement fields, solve M linearized 
forward problems to determine the adjoint fields. 

4. Using the predicted and measured displacements and the adjoint fields 
determine the functional and the gradient. 

5. If the minimum is not obtained, obtain a new guess for fi from BFGS 
and go to step 2. 

In this algorithm the most expensive step is (2), that is the nonlinear 
forward elasticity solves. In a typical finite deformation elasticity problem 
each solve involves incrementing the load parameter and performing several 
Newton iterations to obtain a converged displacement solution at that value 
of the load parameter. This is continued until the final load configuration 
is obtained. We have noted that the sequence of the loading steps can be 
eliminated if one starts from the fully loaded configuration corresponding to 
the previous guess of the material properties. Thus instead of a continuation 
in the load parameter, we perform a continuation in material properties. 
This can be performed at all iterates of the BFGS algorithm except the 
first, where continuation in the loading parameter has to be used. 

Another challenge in solving the finite deformation elasticity problem is 
the treatment of the incompressibility constraint. Several techniques have 
been proposed to handle this problem. We have found that most of these re- 
quire very small load (or material property) increments in order to converge. 
The best strategy that we have used is a stabilized finite element method 
proposed by Klaas et al. (1999) that allows for equal order interpolations 
and is robust. 
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3.1 Examples 

In this section we present two examples that demonstrate the utiUty of 
our iterative strategy. 

Reconstructions in 3D with Simulated Data We test the perfor- 
mance of our algorithm on simulated data. We create a forward model 
of a incompressible hyperelastic material described by the strain energy 
density function in Gokhale et al. (2008), which is a simplified version of 
the Veronda-Wcstmann strain energy density function. The strain energy 
function contains two parameters; /^, corresponds to the shear modulus of 
the material at small strains, and 7 determines the rate of exponential in- 
crease in stress with strain and thus accounts for the nonlinear behavior of 
the material. The sample comprises of a hard inclusion embedded within 
a softer background. In addition, the inclusion hardens with strain at a 
faster rate than the background. In particular, the shear modulus values 
are fibackground = 1 and ^inclusion = 5, whilc the values for the nonhnear 
parameter are ^background = 1 and ^inclusion = 5 (see Figure 5). 




f 



MU 



•4 
-3 

P 

1 




Gamma 

4 

3 

J2 



Figure 5. Distribution of the shear modulus ^ (left) and the nonlinear 
parameter 7 (right) for the simulated tissue-mimicking phantom. 



This specimen was compressed to about 20% strain and 3% Gaussian 
random noise was added to the displacements. The axial displacement fields 
at about 1% strain and 20% strain were used in the algorithm to recover 
the shear modulus and nonlinear parameter distributions. TVD regulariza- 
tion was used to handle the noise in the displacements. The reconstructed 
distribution of parameters is shown in Figure 6. We observe that for both 
fj, and 7 we recover the distribution quite well. 
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Figure 6. Distribution of the reconstructed shear modulus fi (left) and the 
nonlinear parameter 7 (right) for the simulated tissue-mimicking phantom. 



Reconstructions in 2D With Clinical Data We now describe some 
of our latest in-vivo clinical results. Radiofrequency (RF) echo data was 
acquired for patients with benign and malignant breast lesions. The de- 
tails of the study as well as the associated compliances are described in 
Oberai et al. (2009). A free-hand compression technique was used to induce 
displacements with maximum overall strain in the range of 20%. The dis- 
placements were obtained using a guided-scarch block-matching algorithm 
described in Jiang and Hall (2007). Displacement estimates were obtained 
on a 200 x 200/im grid and were down-sampled by a factor of four for the 
modulus reconstructions. Out of the ten specimens included in this study 
we present the results for one malignant and one benign lesion. 

In the modulus reconstruction we assumed plane stress and utilized two 
deformation fields, one at an overall strain of about 1% and another at a 
strain of about 12%. We determined the shear modulus distribution up to 
a multiplicative parameter using the small deformation displacement field 
and thereafter determined the nonlinear parameter 7 up to an additive 
constant using the large deformation displacement field. We employed the 
exponential strain energy density function described in Gokhale et al. (2008) 
and TVD regularization for both fi and 7. 

The results for the benign lesion (a fibroadenoma) are shown in Figure 7. 
We can clearly observe the lesion as a region of elevated stiffness in the image 
for the shear modulus. We also note that the nonlinear parameter within the 
lesion is slightly elevated compared to the background. The reconstructions 
for the malignant lesion (Infiltrating ductal carcinoma) are shown in Figure 
8. Once again the lesion clearly stands out in the shear modulus image. 
In addition, in this case the value of the nonlinear parameter within the 
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lesion is much higher than the background. These results are indicative 
of the ability of the nonlinear strain hardening parameter in distinguishing 
different types of stiff breast lesions. 




Figure 7. Reconstruction of the shear modulus /i (left) and nonlinear strain 
hardening parameter 7 (right) for a benign lesion. 




Figure 8. Reconstruction of the shear modulus /x (left) and nonlinear strain 
hardening parameter 7 (right) for a malignant lesion. 



3.2 Future Work 

With our current approach we are able to solve clinically relevant inverse 

elasticity problems. However, we are still quite a ways away from our goal 

of being able to do so in three dimensions and in almost real-time. There 

are several outstanding issues that are yet to be dealt with including, 

1. Selection of appropriate strain energy density function. For now we 

have used a modified version of the Veronda-Westmann (VW) strain 

energy density function that reproduces the exponential stress-strain 

behavior observed in most tissues. The original VW function contains 

three parameters (Veronda and Westman, 1970). We have reduced 

these two in order to ameliorate the non-uniqueness of the inverse 
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problem. One of these represents the shear modulus at small strains 
and another represents the degree of stiffening of tissue with increasing 
strain. However recent measurements by O'Hagan and Samani (2009) 
indicate that the VW function even with three parameters may not 
be the best strain energy density function for breast tissue. Other 
functions such as the Ycoh model are better able to represent tissue 
behavior and need to be considered. 

2. Selection of regularization parameter. When using Tikhonov regular- 
ization one may use the L-curve or Morozov discrepancy principle in 
order to estimate the regularization parameter. Our experience sug- 
gests that neither of these strategies provide enough discrimination 
when using TVD regularization. Thus a better strategy to select «„ 
is needed. 

3. Boundary conditions. In order to determine the predicted displace- 
ment fields, a well-posed nonlinear forward problem has to be solved. 
This requires displacement or traction boundary conditions on image 
edges. An obvious choice would be to use the measured displacements, 
but this approach leads to strong imposition of noisy displacement 
data, and is not possible when only one component of the displace- 
ment field is known. We have recently devised an approach that cir- 
cumvents the imposition of BCs for the linear elasticity problem. The 
extension of this to the nonlinear is yet to be worked out. 

4. Uniqueness. The uniqueness of the linear inverse problem in two 
dimensions in plane-strain and plane-stress configurations has been 
worked out. We have recently made some progress in extending these 
results to three dimensions and to the nonlinear problem in two di- 
mensions, where we have considered the Neo-Hookean and modified 
Veronda-Westmann strain energy density functions. Similar results 
for other functions that better model soft tissue behavior have not 
been developed. 

5. Computational time. Even with the adjoint approach and the contin- 
uation in material parameters, a typical nonlinear inverse problem on 
a 100 X 100 grid takes about 20 minutes to solve on a fast quad-core 
computer. It will be of great clinical importance to reduce this time 
down to a minute or two, so as to provide feedback to the radiologist 
while the patient was still in the clinic. There are several strategies 
that can be applied to reduce this time including parallelization of the 
linear solves and the matrix assembly procedure, smoothing strategies 
to accelerate iterative convergence, and multigrid techniques for both 
material parameter and displacement discretizations. 
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4 Conclusions 

In Biomechanical Imaging (BMI) , images of the spatial distribution of me- 
chanical properties of tissue are generated from measurements of its response 
to an excitation. In generating these images BMI relies on the solution of 
an inverse problem of determining mechanical properties of a material from 
measurements of its displacement field(s). This problem differs from other 
"traditional" inverse problems in that significant interior data is available 
and should be utilized. New inverse problems techniques that address this 
issue are therefore necessary to solve it. In this set of notes we have described 
two such approaches. We have also presented some representative results 
obtained from these techniques and described avenues of future research. 
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